Invention Machine Blog

describe the image

Subscribe via E-mail

Your email:

Learn More

Take a Tour of Goldfire

Read the Case Studies

Request a Goldfire demo

Follow Us

describe the image  describe the image  describe the image  describe the image

Current Articles | RSS Feed RSS Feed

The Deep Web: Semantic Search Takes Innovation to New Depths

  
  
  

The Web is fast becoming a titanic, complex entity. By the year 2015, it’s estimated that one zettabyte of content will be added to the web each and every year.  Navigating this sea of information presents more and more of a challenge -- particularly when much of that content is not easily accessed by traditional search engines.

The Deep WebDeep Web

When most of us think of the Web, we think of the webpages – from online retailers, to government or organization-sponsored sites, to social media sites, news sites and more  - sites we access directly, via links or via common search engines like Google.  However, the scores of non-textual files (such as videos and images) and content stored in tables or databases far exceeds the ‘searchable’ content.

The ‘Surface Web’, also known as the ‘Visible’ or ‘Searchable’ web, while significantly large at over 8 billion pages, only cracks the surface when it comes to the size of the Internet.  The ‘Deep Web’, (also known as the ‘Invisible Web’, ‘Deepnet’, ‘DarkNet’, ‘Hidden Web’ and ‘Undernet’) refers to the content on the Internet that is not capable of being indexed by standard search engines, leaving the content ‘hidden’. The Deep Web houses over 96% of the content on the web that is publicly accessible.

According to CompletePlanet:

  • Public information on the Deep Web is currently 400 to 550 times larger than the commonly defined World Wide Web.
  • The Deep Web contains 7,500 terabytes of information, compared to 19 terabytes of information in the surface Web.
  • The Deep Web contains nearly 550 billion individual documents compared to the 1 billion of the surface Web.
  • More than an estimated 200,000 Deep Web sites presently exist. Sixty of the largest Deep Web sites collectively contain about 750 terabytes of information – sufficient by themselves to exceed the size of the surface Web by 40 times.

For content to be included as part of the surface web, web crawlers need to be able to find the content, which is done most commonly through links. The Deep Web, which contains some of the richest technical content on the Internet, therefore consists of items such as dynamic URLs, form-controlled entry pages, password-protected access pages, hidden pages, geo-tagged pages, content that is too new to have been indexed, and directories crawlers are told to exclude via robot exclusion files.

Examples of the types of content stored in directories and databases that make up the Deep Web include: patents, laws, ‘people finders’ such as lists of professionals like engineers and doctors, online catalogs, web stores, digital exhibits, multimedia and graphical files and more.  Typically organized around a particular field, the content tends to be very rich in engineering, scientific, technical, or domain specific knowledge generated over the years by specialized practitioners.

Tapping the Deep Web to Fuel Innovation

Imagine the type of knowledge and resources that could be leveraged if researchers were able to harness the information held within the remaining 96% of the Internet? The ability for access to this type of content is today one of the most important sources of competitive differentiation and advantage for companies.

Invention Machine is a leader in semantic research technology that unlocks decisions in data. Our patented semantic question-answering engine helps companies accelerate innovation, increase productivity and deliver superior products and services.

Invention Machine’s innovation intelligence platform, Goldfire, is powered by our world-class semantic question-answering technology and proven innovation tools and methods. 

With Goldfire, companies access over 3,300 of the richest Deep Web sites containing scientific and technical information from government, academic, commercial, and professional databases that cannot be accessed by conventional web searches. Also included in Goldfire is a semantic index of more than 5.6 million documents from over 1,750 of the best Deep Web sites and a special utility providing access methods to other Deep Web sites.

Goldfire’s patented semantic research capabilities understands the questions being asked, delivering relevant answers instead of simply producing the collection of keyword related hyperlinks of the typical search engine. Goldfire also has multi-lingual capabilities in English, French, German, and Japanese enabling researchers to retrieve answers in their native language despite the content being authored in languages they cannot understand.

When the power of Goldfire’s semantics and Deep Web searching are combined, it allows users to make sense of all of this unstructured, previously inaccessible information, providing precise answers to even the most challenging research questions, allowing workers to infuse knowledge into their research processes -- saving valuable time and money and increasing productivity.

Interested in learning more about Goldfire’s powerful Deep Web and semantic capabilities? Watch this overview video on semantics or see if Goldfire is right for you.

Request a demo

Comments

Thanks for writing this. I really feel as though I know so much more about this than I did before. Your blog really brought some things to light that I never would have thought about before reading it. You should continue this, Im sure most people would agree youve got a gift. 
That is the proper weblog for anyone who needs to search out out about this topic. You realize a lot its virtually arduous to argue with you (not that I actually would need…HaHa). You definitely put a new spin on a topic thats been written about for years. Nice stuff, just great!
Posted @ Thursday, June 28, 2012 4:24 AM by sous vetement pas cher femme
You have caught my interest as well. Compelling metaphor you have there. Thanks, I will be back to check the blog.
Posted @ Thursday, September 20, 2012 4:30 PM by Mary Guillemette
I've had issues attempting similar symantic searches in the past. Are you saying that you're opening up all foreign language searches? 
 
Do you perform searches of patent databases also? 
 
Thanks
Posted @ Friday, January 18, 2013 8:09 AM by London
Simply thought which i would inform you that BeyondMegapixels will be a welcomed addition for your list! 
Posted @ Tuesday, April 02, 2013 8:25 AM by photographymagazines
Low-fat milk products are vital that you ensure you're getting sufficient calcium with regard to strong bone fragments. Use milk inside your oatmeal or together with your whole-grain cereal with regard to breakfast. Consume yogurt, slices associated with cheese or even cottage cheese like a snack. Sprinkle low-fat cheese in your salad or even drink the glass associated with 2 % milk together with your meal. 
Posted @ Tuesday, April 02, 2013 8:25 AM by thenutritiontips
What's safe on the internet shopping? How much money spent with online stores is growing at a superb rate because of the convenience, choice as well as low prices that may be found. This is actually all excellent but how can you know that you're spending having a reputable web site? It is essential to know very well what safe on the internet shopping is actually, what to consider and things to avoid. 
Posted @ Tuesday, April 02, 2013 8:25 AM by buyonlineguru
Indeed!!! I experienced this precise feeling within Nice – Southern of Portugal. I adore the older, lived-in feel of the European city for example Lisbon – this looks therefore beautiful and filled with stories! 
Posted @ Tuesday, April 02, 2013 8:26 AM by cabinsinontario
Men's stores offer man shoppers an individual and enjoyable shopping encounter that suits their requirements and pursuits. It is actually comfortable to allow them to shop with regard to items within an establishment that's filled with individuals who share their own interests as well as passions. 
Posted @ Tuesday, April 02, 2013 8:26 AM by theonlinefashionshopping
Bankruptcy is a big step, and one that should be taken only after careful consideration and with the guidance of an experienced bankruptcy attorney.
Posted @ Friday, April 12, 2013 1:51 AM by bankruptcy law lexington ky
The quickest way to get wet is with an Aurora Pool and Spa above ground swimming pool. 
Posted @ Wednesday, May 08, 2013 2:50 AM by aurorapoolsandspas
Break Security has been working in this field for a number of years. Both phones and tablets can be scoured by Break Security to locate any major security flaws 
Posted @ Wednesday, May 15, 2013 5:31 AM by Penetration Testing
By state law, there is typically a small fee that you must pay before a public record can be released and divorce is one of the categories of such public information.
Posted @ Thursday, May 16, 2013 12:30 AM by law-states.com
Some people come to our life and take something leaving our hear in pain. Anyway, I think there's a huge difference on "Surface Web" & "Deep Web"
Posted @ Thursday, May 23, 2013 11:11 PM by remote control helicopters with camera
Your post is very helpful and remarkable, I'm typically to blogging and i actually value your content.I really appreciate this post. I been looking all over for this! Thank goodness I found it on Bing. You have made my day! Thank you again, 
Posted @ Friday, May 24, 2013 8:17 AM by free sports dissertation topics
Post Comment
Name
 *
Email
 *
Website (optional)
Comment
 *

Allowed tags: <a> link, <b> bold, <i> italics