• Topics
  • Artificial wisdom
  • Autopilot
  • network
  • Processor
  • 手機
  • exhibition activities
    • CES
      • CES 2014
      • CES 2015
      • CES 2016
      • CES 2017
      • CES 2018
      • CES 2019
      • CES 2020
    • MWC
      • MWC 2014
      • MWC 2015
      • MWC 2016
      • MWC 2017
      • MWC 2018
      • MWC 2019
    • Computex
      • Computex 2014
      • Computex 2015
      • Computex 2016
      • Computex 2017
      • Computex 2018
      • Computex 2019
    • E3
      • E3 2014
      • E3 2015
      • E3 2016
      • E3 2017
    • IFA
      • IFA 2014
      • IFA 2015
      • IFA 2016
      • IFA 2017
    • TGS
      • TGS 2016
  • About us
    • About mashdigi
    • mashdigi website contact details
2026 / 06 / 13 17:41 Saturday
  • Login
mashdigi-Technology, new products, interesting news, trends
  • Topics
  • Artificial wisdom
  • Autopilot
  • network
  • Processor
  • 手機
  • exhibition activities
    • CES
      • CES 2014
      • CES 2015
      • CES 2016
      • CES 2017
      • CES 2018
      • CES 2019
      • CES 2020
    • MWC
      • MWC 2014
      • MWC 2015
      • MWC 2016
      • MWC 2017
      • MWC 2018
      • MWC 2019
    • Computex
      • Computex 2014
      • Computex 2015
      • Computex 2016
      • Computex 2017
      • Computex 2018
      • Computex 2019
    • E3
      • E3 2014
      • E3 2015
      • E3 2016
      • E3 2017
    • IFA
      • IFA 2014
      • IFA 2015
      • IFA 2016
      • IFA 2017
    • TGS
      • TGS 2016
  • About us
    • About mashdigi
    • mashdigi website contact details
No Result
View All Result
  • Topics
  • Artificial wisdom
  • Autopilot
  • network
  • Processor
  • 手機
  • exhibition activities
    • CES
      • CES 2014
      • CES 2015
      • CES 2016
      • CES 2017
      • CES 2018
      • CES 2019
      • CES 2020
    • MWC
      • MWC 2014
      • MWC 2015
      • MWC 2016
      • MWC 2017
      • MWC 2018
      • MWC 2019
    • Computex
      • Computex 2014
      • Computex 2015
      • Computex 2016
      • Computex 2017
      • Computex 2018
      • Computex 2019
    • E3
      • E3 2014
      • E3 2015
      • E3 2016
      • E3 2017
    • IFA
      • IFA 2014
      • IFA 2015
      • IFA 2016
      • IFA 2017
    • TGS
      • TGS 2016
  • About us
    • About mashdigi
    • mashdigi website contact details
No Result
View All Result
mashdigi-Technology, new products, interesting news, trends
No Result
View All Result
Home App

OpenAI launches web crawler technology called GPTBot to collect public data available for artificial intelligence training in a more transparent way

Author: Mash Yang
2023-08-08
in App, Market dynamics, Life, network, software
A A
0
Share to FacebookShare on TwitterShare to LINE

In order to resolve the privacy and copyright disputes related to extracting data from the public web environment, OpenAI announced the launch ofWeb crawling technology called GPTBot, will collect the data needed for artificial intelligence training in a more transparent way.

OpenAI launches web crawler technology called GPTBot to collect public data available for artificial intelligence training in a more transparent way

This is an advertisement.

OpenAI said that GPTBot will use a full string and token to explain the identity of its crawler robot. At the same time, the public web information it crawls will only be used to improve future artificial intelligence models, and content that requires payment will be excluded.

However, if the webpage operator does not want GPTBot to crawl its content, for example, if the webpage may contain a large amount of content involving personal privacy, they only need to add a "GPTBot" description to the robots.txt file in the webpage structure, or customize the content that GPTBot can crawl. OpenAI even provides a way to directly prohibit GPTBot from crawling web page data by restricting IP access range, allowing webpage operators to prevent their content from being crawled by GPTBot.

In the past, many websites were configured to prevent search engines from crawling web data. With the continued growth of artificial intelligence (AI) technology, more and more AI training relies on large amounts of public data for learning. This has heightened concerns among many website operators about their content being used for AI training, potentially impacting valuable data or privacy. Therefore, they are requiring AI technology providers to access web data in a reasonable manner.

This is an advertisement.
This is an advertisement.
Tags: AIGPTBotOpenAIArtificial wisdomcrawler
ShareTweetShare
Mash Yang

Mash Yang

Founder and editor of mashdigi.com, and student of technology journalism.

Post a responseCancel Reply

This site uses Akismet service to reduce spam.Learn more about how Akismet processes website visitor comments.

Translation (Tanslate)

Recent updates:

Sweeping aside GPT-5.5 and Gemini 3.1 Pro! Anthropic's next-generation large-scale model "Fable 5" is available for a limited-time free trial to subscribers.

After receiving a verbal warning from the US government, Anthropic announced an emergency blockade of access for all customers and employees of Fable 5 and Mythos 5.

2026-06-13
Paramount announces merger of Paramount+ service and Showtime pay TV channel in the US

Reshaping the American media landscape! The US Department of Justice approves Paramount Skydance's acquisition of Warner Bros. Discovery Channel.

2026-06-13
Unable to handle the frenzy of retail investors following SpaceX's IPO, stock trading platform Robinhood crashed due to record-breaking traffic.

Unable to handle the frenzy of retail investors following SpaceX's IPO, stock trading platform Robinhood crashed due to record-breaking traffic.

2026-06-13
mashdigi-Technology, new products, interesting news, trends

Copyright © 2017 mashdigi.com

  • About mashdigi.com
  • Place ads
  • Contact mashdigi.com

Follow us

Welcome back!

Login to your account below

Forgotten Password?

Retrieve your password

Hãy nhập tên người dùng hoặc địa chỉ email để mở mật khẩu

Log In
×

You are about to be redirected to an external website.

The link you clicked will open an external webpage:

In reciprocal calculation...
×

Want to take a break? We recommend the following content:

  • • Newer Mac models equipped with the M1 processor require an internet connection to perform a proper system reset.
  • Google has developed a new lossless image compression technology that uses neural network-like algorithms.
  • • OpenAI is reportedly developing an AI music generation tool and collaborating with The Juilliard School to annotate musical score training data.

You can return by swiping the page or clicking anywhere.

No Result
View All Result
  • About mashdigi.com
  • Place ads
  • Contact mashdigi.com

Copyright © 2017 mashdigi.com