Optimize language model performance and alignment with high-quality preference data across multiple domains.
Description | Domain | Modality | # of Experts | Duration | Accuracy |
---|---|---|---|---|---|
Multi-criteria ranking of model coding outputs across coding languages (Java, SQL, JavaScript, Python, Go, C/C++) | Coding | Text | 20 | 8 months | 92% (90% req.) |
Model response scoring, categorization and annotation across domains (math, roleplay, language, education) | Multi-domain | Text | 40 | 15 months | 92% (90% req.) |
Multi-criteria scoring of language translation outputs | Language | Text | 20 | 1 month | 90% (90% req.) |
Multi-criteria ranking of model generated videos (realism, motion dynamics, consistency, alignment) | Art | Multi-modal | 25 | 1 month | 90% (90% req.) |
Platform content scoring for model safety Improvement | Language | Text | 15 | 6 months | 99% (98% req.) |