1、UNSTRUCTURED DATA FORGENERATIVE AI AND AGENTSESSENTIAL GUIDETABLE OF CONTENTSIntroduction .3The New Data Foundation .4Getting Your Data Ready for AI .8Integrating Multimodality .11Building a Strong Data Foundation for AI .14Structuring Unstructured Data with the AI Data Cloud .19Table of Contents|2
2、THE ESSENTIAL GUIDE TO UNSTRUCTURED DATA FOR GENERATIVE AI AND AGENTSINTRODUCTIONWe live inside a sea of data.A recent report highlights how much in every minute of each day human beings are creating that data:Streaming363,000hoursofNetflixcontent Asking Siri 1 million questions Viewing 3.4 million
3、YouTube videos Performing5.9millionGooglesearches Sending18.8milliontextmessages Watching138.9millionFacebookandInstagramreels Composing 251 million emailsAnd thats just data delivered via the internet.It doesnt include all the data individuals and organizations create each day thats stored on perso
4、nal devices and network servers,or data used for monitoring systems and machine-to-machine communications.In 2024,the total volume of information generated by humans and machines was estimated to reach 147 zettabytes,or 147 trillion gigabytes.If you converted all that data to recorded speech,it woul
5、d generate more than 400 million years of continuous audio,or nearly four times the number of words ever spoken by human beings.This unfathomable amount of data is predicted to increase by another 2025%each year.By the end of 2025,global data production is expected to exceed 180 zettabytes.This data
6、 represents an enormous untapped opportunity for insights and innovation once we uncover the right ways to unlock it.This ebook will describe how organizations can prepare their unstructured data for use in training language models and building innovative AI applications.Introduction|3THE ESSENTIAL