Towards high-quality (maybe synthetic) datasets

Towards high-quality (maybe synthetic) datasets

Released Wednesday, 9th October 2024
Good episode? Give it some love!
Towards high-quality (maybe synthetic) datasets

Towards high-quality (maybe synthetic) datasets

Towards high-quality (maybe synthetic) datasets

Towards high-quality (maybe synthetic) datasets

Wednesday, 9th October 2024
Good episode? Give it some love!
Rate Episode

As Argilla puts it: "Data quality is what makes or breaks AI." However, what exactly does this mean and how can AI team probably collaborate with domain experts towards improved data quality? David Berenstein & Ben Burtenshaw, who are building Argilla & Distilabel at Hugging Face, join us to dig into these topics along with synthetic data generation & AI-generated labeling / feedback.

Show More

Unlock more with Podchaser Pro

  • Audience Insights
  • Contact Information
  • Demographics
  • Charts
  • Sponsor History
  • and More!
Pro Features