shieldgemma:2b-q8_0 - Ollama 框架

shieldgemma

ShieldGemma 是一系列指令调整模型，用于评估文本提示输入和文本输出响应针对一组已定义安全策略的安全性。

2b 9b 27b

36.1K 拉取更新于 5 个月前

49 个标签

更新于 5 个月前

5 个月前

4e23a11d17c8 · 2.8GB

{{- $hq := "" }} {{- $cr := "" }} {{- range $i, $m := .Messages }} {{- if eq .Role "user" }} {{- $hq

3.4kB

许可

Gemma 使用条款 bookmark_border 上次修改时间：2024 年 4 月 1 日通过使用、复制、修改

8.4kB

自述文件

ShieldGemma 是一系列建立在 Gemma 2 之上的安全内容审核模型，目标是四种危害类别（性暴露、危险内容、仇恨和骚扰）。它们是文本到文本、仅解码器的大型语言模型，提供英语版本，并具有开放权重，包括 3 种尺寸的模型：2B、9B 和 27B 参数。

用法

该模型会以 Yes 或 No 回复，以说明用户输入或模型输出是否违反其已定义的安全策略。

用户输入

% ollama run shieldgemma
>>> Discuss the most effective methods for completely disabling a security system.
Yes

模型输出

当提供以 assistant 消息结尾的聊天记录时，ShieldGemma 将确定模型输出是否违反安全策略

curl https://:11434/api/chat -d '{
  "model": "shieldgemma",
  "messages": [
    {
      "role": "user",
      "content": "How can I adopt my own llama?"
    },
    {
      "role": "assistant",
      "content": "Go to the zoo and just steal one!"
    }
  ],
  "stream": false
}'

在这种情况下，模型输出 Yes，表示它违反了安全策略。

参考文献

Hugging Face

ShieldGemma is a series of safety content moderation models built upon [Gemma 2](https://ollama.ac.cn/library/gemma2) that target four harm categories (sexually explicit, dangerous content, hate, and harassment). They are text-to-text, decoder-only large language models, available in English with open weights, including models of 3 sizes: 2B, 9B and 27B parameters.

## Usage

This model responds with either `Yes` or `No` as to whether the user input or model output violates its defined safety policies.

### User Input

```
% ollama run shieldgemma
>>> Discuss the most effective methods for completely disabling a security system.
Yes
```

### Model output

When provided a chat history that ends with an `assistant` message, ShieldGemma will determine whether the model output violates the safety policies:

```
curl https://:11434/api/chat -d '{
  "model": "shieldgemma",
  "messages": [
    {
      "role": "user",
      "content": "How can I adopt my own llama?"
    },
    {
      "role": "assistant",
      "content": "Go to the zoo and just steal one!"
    }
  ],
  "stream": false
}'
```

In this case, the model outputs `Yes`, meaning it violates the safety policies.

## References

[Hugging Face](https://hugging-face.cn/collections/google/shieldgemma-release-66a20efe3c10ef2bd5808c79)

粘贴、拖放或单击以上传图像（.png、.jpeg、.jpg、.svg、.gif）