• daniskarma@lemmy.dbzer0.com
    link
    fedilink
    arrow-up
    0
    arrow-down
    1
    ·
    edit-2
    7 days ago

    Why would they request so many times a day the same data if the objective was AI model training. It makes zero sense.

    Also google bots obeys robots.txt so they are easy to manage.

    There may be tons of reasons google is crawling your website. From ad research to any kind of research. The only AI related use I can think of is RAG. But that would take some user requests aways because if the user got the info through the AI google response then they would not enter the website. I suppose that would suck for the website owner, but it won’t drastically increase the number of requests.

    But for training I don’t see it, there’s no need at all to keep constantly scraping the same web for model training.