A Data Science Central Community
The Semantic Web is an idea of World Wide Web inventor Tim Berners-Lee where the web captures the semantics, or meaning, of data, and where machines are enabled to interact with that meta data. Berners-Lee observes that although search engines index much of the Web's content, keywords only provide an indirect association of the meaning that human actors are seeking. He foresees a number of ways in which developers and authors, singly or in collaborations, can use self-descriptions and other techniques so that context-understanding programs can selectively find what users want.
Tim Berners-Lee originally expressed the vision of the Semantic Web as follows:
I have a dream for the Web [in which computers] become capable of analyzing all the data on the Web – the content, links, and transactions between people and computers. A ‘Semantic Web’, which should make this possible, has yet to emerge, but when it does, the day-to-day mechanisms of trade, bureaucracy and our daily lives will be handled by machines talking to machines. The intelligent agents people have touted for ages will finally materialize.
The world of semantic databases just got a little bit more interesting with the announcement by Franz, Inc. and Stillwater SC of having reached a trillion triple semantic data store for telecommunication data.
The database was constructed in an HPC on-demand cloud service and occupied 8 compute servers and 8 storage servers. The trillion triple data set spanned roughly 100TB of storage. It took roughly two weeks to load the data, but after that provides interactive query rates for knowledge discovery and data mining.