AI Companies Running Out of Training Data After Burning Through Entire Internet - eviltoast
  • voidx@futurology.todayOPM
    link
    fedilink
    English
    arrow-up
    5
    ·
    8 months ago

    They seem to be experimenting with that for sure, but need to ensure quality of the model doesn’t degrade, as per source article:

    Anthropic’s chief scientist, Jared Kaplan, said some types of synthetic data can be helpful. Anthropic said it used “data we generate internally” to inform its latest versions of its Claude models. OpenAI also is exploring synthetic data generation, the spokeswoman said.