14 C
Munich
星期四, 2 7 月, 2026

The emergence of the web data infrastructure layer for AI

Must read

Wacky Wimbledon menu features strawberry sushi and duck slathered in Pimm’s

Wimbledon fans can now get even more involved with all the tennis action as one London location has launched a menu centred around the...

SpaceX may build more than just a network to outshine AT&T, T-Mobile, and Verizon

SpaceX may be about to become a threat to Android and Apple as well. #SpaceX #build #network #outshine #ATampT #TMobile #Verizon

Woman left with 38 brain parasites years after foreign trip and passing massive tapeworm

A Welsh woman who backpacked around India in 2007 says she later suffered seizures and a mental health crisis after doctors found 38 parasitic...

T-Mobile customers using very old devices will lose connectivity next month

T-Mobile is shutting down 2G. #TMobile #customers #devices #lose #connectivity #month

The next frontier in AI may depend on a new web data infrastructure layer that can enable models to discover and map this ever-expanding digital realm. This layer must be able to navigate hundreds of millions of existing web domains and billions of new URLs created each week, delivering real-time information and overcoming technical barriers.

“The data suggests there’s far more data out there,” says Or Lenchner, CEO of Bright Data, a web data collection platform. “Think of the universe: It’s out there, but you don’t know what you don’t know.”

Enabling access to fresh, relevant, and trustworthy data

While early AI breakthroughs were driven by scaling training data and model size, organizations are now encountering a fundamental bottleneck: They need to keep pace with the dynamic, unstructured, and constantly evolving nature of web data in order to ground outputs in current and verifiable information. AI performance increasingly depends not just on model architecture but on a system’s compute, networking, retrieval, and data engineering capabilities—that is, the system’s ability to quickly and reliably retrieve data that is fresh, relevant, and trustworthy.

Traditional model training relies on snapshots of information collected at a particular point in time. Training AI on such static data is no longer sufficient. To track fluctuations such as competitor pricing, consumer sentiment, and market trends, companies need a constant feed of new information, pulling data in real time along with relevant context. Their infrastructure must therefore be able to handle millions of simultaneous interactions across websites that vary by geography, language, format, and access rules.

“If it can’t retrieve real-time information, it lacks context,” Lenchner says. “In a business setting, that’s not acceptable anymore. Stale answers lead to bad decisions and disappointed consumers.”

Speed is not merely a matter of convenience; it’s a matter of necessity. Today’s organizations operate in environments where prices, inventory, markets, security threats, and customer behavior change continuously. Delayed data retrieval can reduce the usefulness of an otherwise sophisticated model.

Using live, high-quality web data can also reduce AI hallucinations because the model has a more relevant knowledge base. This builds user trust. In fact, one survey found that 56% of AI practitioners said businesses need access to real-time web data to improve trust in AI outputs. To ensure the model runs efficiently and effectively, the information must also be pared down to the appropriate essentials. 

Despite the introduction of retrieval-augmented generation (RAG), where models pull in external data at the moment of a query, many AI systems still struggle to deliver outputs that are current, contextually relevant, and trustworthy in operational settings. According to Gartner, 60% of AI projects that are not supported by AI-ready data—accurate, structured, organized, and contextualized—will be abandoned by the end of the year. 

#emergence #web #data #infrastructure #layer

- Advertisement -

More articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

- Advertisement -

Latest article

Wacky Wimbledon menu features strawberry sushi and duck slathered in Pimm’s

Wimbledon fans can now get even more involved with all the tennis action as one London location has launched a menu centred around the...

SpaceX may build more than just a network to outshine AT&T, T-Mobile, and Verizon

SpaceX may be about to become a threat to Android and Apple as well. #SpaceX #build #network #outshine #ATampT #TMobile #Verizon

Woman left with 38 brain parasites years after foreign trip and passing massive tapeworm

A Welsh woman who backpacked around India in 2007 says she later suffered seizures and a mental health crisis after doctors found 38 parasitic...

T-Mobile customers using very old devices will lose connectivity next month

T-Mobile is shutting down 2G. #TMobile #customers #devices #lose #connectivity #month

Boy, 16, fighting for life after Birmingham mosque shooting

A 16-year-old boy was rushed to hospital with potentially life-threatening injuries after he was shot near a mosque in BirminghamJoe Smith News Reporter and...