Tuesday 22 May 2012

Amazon CloudSearch-Information Retrieval as a Service

The idea of using computers to search for relevant pieces of information was popularized in the article “As We May Think” by Vannevar Bush in 1945.
As We May Think predicted (to some extent) many kinds of technology invented after its publication, including hypertext, personal computers, the Internet, the World Wide Web, speech recognition, and online encyclopedias such as Wikipedia: “Wholly new forms of encyclopedias will appear, ready-made with a mesh of associative trails running through them, ready to be dropped into the memex and there amplified.”
According to Wikipedia, Information retrieval (IR) is the area of study concerned with searching for documents, for information within documents, and for metadata about documents, as well as that of searching structured storage, relational databases, and the World Wide Web. There is overlap in the usage of the terms data retrieval, document retrieval, information retrieval, and text retrieval, but each also has its own body of literature, theory, praxis, and technologies.
Purpose of Information Retrieval: To find the desired content quickly and efficiently by simply consulting the index.
“News & Announcements” section in AWS Newsletter brings new surprise in terms of Amazon’s offering and the way they are expanding the domain every month. AWS users are not surprised about the surprise they are getting but its more in terms of what kind of offering will be targeted by Amazon remains surprise. In April 2012 AWS has come up with new offering that is Amazon CloudSearch.
Amazon CloudSearch offers a way to integrate search into websites and applications, whether they’re customer-facing or for use behind the corporate firewall. It’s the same search technology that’s available at Amazon.com.
Amazon CloudSearch is a fully-managed search service in the cloud that allows customers to easily integrate fast and highly scalable search functionality into their applications. Amazon CloudSearch effortlessly scales as the amount of searchable data increases or as the query rate changes, and developers can change search parameters, fine tune search relevance, and apply new settings at any time without having to upload the data again.
http://d36cz9buwru1tt.cloudfront.net/cloudsearch/CloudSearchScaling.png
http://d36cz9buwru1tt.cloudfront.net/cloudsearch/CloudSearchScaling.png
According to Amazon Web Services Blog,
“CloudSearch hides all of the complexity and all of the search infrastructure from you. You simply provide it with a set of documents and decide how you would like to incorporate search into your application.
You don’t have to write your own indexing, query parsing, query processing, results handling, or any of that other stuff. You don’t need to worry about running out of disk space or processing power, and you don’t need to keep rewriting your code to add more features.
With CloudSearch, you can focus on your application layer. You upload your documents, CloudSearch indexes them, and you can build a search experience that is custom-tailored to the needs of your customers.”

Architecture

Configuration Service: The configuration service enables you to create and configure search domains. Each domain encapsulates a collection of data you want to search.
  • Indexing Option specifies the field to include it is index
  • Text Options, to avoid words during indexing
  • Rank Expressions to determine how search results are ranked.
Document Service: to make changes to a domain’s searchable data.
Search Service: The search service handles search requests for a domain.

Features

Benefits

  • Offloads administrative burden of operating and scaling a search platform
  • No need to worry about hardware provisioning, data partitioning, or software patches; it will be taken care by service provider
  • Pay-as-you-go pricing with no up-front expenses

Pricing Dimensions

  • Search instances
  • Document batch uploads
  • Index Documents requests
  • Data transfer

Pricing

Search Instance Type
US East Region
Small Search Instance
$0.12 per hour
Large Search Instance
$0.48 per hour
Extra Large Search Instance
$0.68 per hour

Video Tutorials

Introducing Amazon CloudSearch
To see a summary of Amazon CloudSearch features, please watch this video.
Introducing Amazon CloudSearch
Building a Search Application Using Amazon CloudSearch
To see how to use Amazon CloudSearch to develop a search application, including uploading and indexing a large public data set, setting up index fields, customizing ranking, and embedding search in a sample application, please watch this video.
Building a Search Application Using Amazon CloudSearch


SOURCE : Amazon CloudSearch-Information Retrieval as a Service.


No comments:

Post a Comment