You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: clawhub/SKILL.md
+14-7Lines changed: 14 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,10 @@
1
1
# Data Quality Evaluation with Dingo
2
2
3
-
Evaluate AI training data, SFT datasets, RAG pipelines, and OCR documents using Dingo — a comprehensive data quality assessment framework with 70+ built-in metrics.
3
+
Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool.
4
+
5
+
***GitHub**: https://github.com/MigoXLab/dingo
6
+
***SaaS Platform**: https://dingo.openxlab.org.cn/ (free, no install needed)
7
+
***PyPI**: https://pypi.org/project/dingo-python/
4
8
5
9
## Installation
6
10
@@ -313,12 +317,6 @@ After evaluation completes, the agent should:
313
317
2. If there are failures, briefly explain what each failing metric means
314
318
3. Suggest next steps (e.g., "15% of items have colon-ending issues — you may want to clean those")
315
319
316
-
## SaaS platform
317
-
318
-
For users who prefer a web interface, Dingo SaaS is available at: https://dingo.openxlab.org.cn/
319
-
320
-
No local installation needed — upload data, configure evaluators, and view reports in the browser.
321
-
322
320
## Troubleshooting
323
321
324
322
***`ModuleNotFoundError: No module named 'dingo'`**: Run `pip install dingo-python` (note: the package name is `dingo-python`, not `dingo`)
@@ -333,3 +331,12 @@ No local installation needed — upload data, configure evaluators, and view rep
333
331
* Rule-based evaluators run locally with zero API cost
334
332
* Results are written to the `outputs/` directory by default (timestamped subdirectories)
335
333
* The `content` field is the most commonly mapped field — it's the main text that most evaluators check
Copy file name to clipboardExpand all lines: clawhub/_meta.json
+2-2Lines changed: 2 additions & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -1,9 +1,9 @@
1
1
{
2
2
"name": "dingo-data-quality",
3
3
"version": "1.0.0",
4
-
"description": "Evaluate AI training data quality using Dingo — 70+ built-in metrics for text, RAG, OCR, and multimodal datasets. Rule-based checks (free, fast) and LLM-as-a-Judge evaluation with any OpenAI-compatible API.",
4
+
"description": "Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool.",
0 commit comments