Welcome

Calais Marmoset is a simple yet powerful tool that makes it easy for you to generate and embed metadata in your content in preparation for Yahoo! Search's new open developer platform, SearchMonkey, as well as other metacrawlers and semantic applications.

In order to use Marmoset you must manage your own web site and be proficient in installing some simple PHP code templates.

If this doesn't sound like you - wait a little bit. The Marmoset capabilities will be deployed over the coming months via a variety of means that will require less technical knowledge to deploy.

Be sure to read the documentation and then download the Marmoset package.

Introduction

Marmoset provides a simple way to make your Web site's contents available through intelligent search.

Intelligent Search

Search engines allow filtering documents based on keywords that appear in a Web site. One word can refer to several different things. For example, when you search for 'Washington' using a search engine, results will likely relate to the city, the state and the person.

Intelligent search allows users to specify with greater precision what they are looking for. For example, a user may ask for results relating to 'Washington', but only when it refers to the state.

OpenCalais Semantic Analysis

The OpenCalais Web service can analyze text and provide rich semantic data to provided text. For example, when the word 'Washington' appears in the text, the OpenCalais Web service can determine whether it refers to the city, the state or the person.

Rich Content on Your Web Site

Using OpenCalais you can provide search engine crawlers with rich semantic data to consider when they index your page. Yahoo!'s search engine analyzes semantic data provided in Microformats, and other search engines are likely to follow.

Marmoset allows you to attach the Microformats data into dynamic Web pages on your site with no development effort on your part. As a result, users accessing your Web site through search engines will get better targeted results.

How It Works

In order to do that Marmoset identifies page requests made by search robots. For requests made by browsers your Web site content will be returned unchanged.

When a search robot is identified, Marmoset invokes the OpenCalais Web service and retrieves rich semantic data for the requested page. It then injects the resulting Microformats into the original Web page and returns the result to the search robot.

Search engines that analyze the Microformats can offer intelligent search for your Web site.

Example

Consider, for example, the following PHP page:

<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
    <head>
        <title>Your Page Title</title>
    </head>
    <body>
        <div>Your Page Contents</div>
        <?php Your PHP Code ?>
    </body>
</html>

When a browser requests the page, it will be returned as is. However, when a search robot requests the page, the following result will be sent instead:

<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
    <head>
        <title>Your Page Title</title>
    </head>
    <body>
        <div class="vcard">
            <span class="fn">A person from your page</span> 
              <div class="org">The person's organization</div> 
            <div class="title">The person's title</div> 
        </div>
        <a href="Link to organization" rel="tag">Organization's name</a>
        <div>Your Page Contents</div>
        <?php Your PHP Code ?>
    </body>
</html>

As you can see, semantic data is injected at the beginning of the HTML <body> element.