Spaces:

amir22010
/

MarketMate

Sleeping

App Files Files Community

amir22010 commited on Oct 24, 2024

Commit

d351d70

verified ·

1 Parent(s): 632facd

Update app.py

Browse files

Files changed (1) hide show

app.py +37 -5

app.py CHANGED Viewed

@@ -103,17 +103,49 @@ async def greet(product,description):
         engine: groq
         model: llama-3.2-11b-text-preview
     rails:
       input:
         flows:
-          - self check input
-          - check blocked terms
     output:
         flows:
-          - self check output
-          - check blocked terms
-          - self check facts
     streaming: False
     """

         engine: groq
         model: llama-3.2-11b-text-preview
+      - type: llama_guard
+        engine: groq
+        model: llama-guard-3-8b
     rails:
       input:
         flows:
+          - content safety check input $model=main
     output:
         flows:
+          - content safety check output $model=llama_guard
+    prompts:
+      - task: content_safety_check_input $model=main
+        content: |
+          You are a policy expert trying to help determine whether a user prompt is in
+          violation of the defined safety policies.
+          <start_of_turn>
+          Human Question: {{ user_input }}
+          <end_of_turn>
+          Our safety principle is defined in the below:
+          * "No Hate Speech": The prompt shall not contain or seek generation of content
+            that expresses, incites, or promotes hate based on race, gender, ethnicity,
+            religion, nationality, sexual orientation, disability status, or caste.
+          * "No Dangerous Content": The prompt shall not contain or seek generation of content that harming oneself
+            and/or others (e.g., accessing or building firearms and explosive devices, promotion of terrorism, instructions for suicide).
+          * "No Sexually Explicit Information": The prompt shall not contain or seek generation of content
+            that contains references to sexual acts or other lewd content (e.g., sexually graphic descriptions,
+            content aimed at causing arousal). Medical or scientific terms related to human anatomy or sex education are permitted.
+          * "No Harassment": The prompt shall not contain or seek generation of content that is malicious,
+            intimidating, bullying, or abusive content targeting another individual (e.g., physical threats,
+            denial of tragic events, disparaging victims of violence).
+          ===
+          Does the human prompt/question violate the above principle? Your answer must start with
+          'Yes' or 'No'. And then walk through step by step to be sure we answer
+          correctly.
+        output_parser: is_content_safe
     streaming: False
     """