Simple Format
Description
Calais now supports a simple output format that provides a snapshot of the entire RDF content and facilitates the parsing and integration of the results.
The simple output includes all Entities and a subset of Events and Facts. The following table summarizes the metadata elements included in the simple output format. Entities are shown before Events and Facts.
Metadata Element Simple Output Value
| Metadata Element - Entities |
Simple Output Value |
| Anniversary | value of attribute Anniversary |
| City | value of attribute City |
| Company | value of attribute Company |
| Continent | value of attribute Continent |
| Country | value of attribute Country |
| Currency | value of attribute Currency |
| EmailAddress | value of attribute EmailAddress |
| EntertainmentAwardEvent | value of attribute EntertainmentEvent |
| Facility | value of attribute Facility |
| FaxNumber | value of attribute FaxNumber |
| Holiday | value of attribute Holiday |
| IndustryTerm | value of attribute IndustryTerm |
| MarketIndex | value of attribute MarketIndex |
| MedicalCondition | value of attribute MedicalCondition |
| MedicalTreatment | value of attribute MedicalTreatment |
| Movie | value of attribute Movie |
| MusicAlbum | value of attribute MusicAlbum |
| MusicGroup | value of attribute MusicGroup |
| NaturalDisaster | value of attribute NaturalDisaster |
| NaturalFeature | value of attribute NaturalFeature |
| OperatingSystem | value of attribute OperatingSystem |
| Organization | value of attribute Organization |
| Person | value of attribute Person |
| PhoneNumber | value of attribute PhoneNumber |
| Product | value of attribute Product |
| ProgrammingLanguage | value of attribute ProgrammingLanguage |
| ProvinceOrState | value of attribute ProvinceOrState |
| PublishedMedium | value of attribute PublishedMedium |
| RadioProgram | value of attribute RadioProgram |
| RadioStation | value of attribute RadioStation |
| Region | value of attribute Region |
| SportsEvent | value of attribute SportsEvent |
| SportsGame | value of attribute SportsGame |
| SportsLeague | value of attribute SportsLeague |
| TVShow | value of attribute TVShow |
| TVStation | value of attribute TVStation |
| Technology | value of attribute Technology |
| URL | value of attribute URL |
| Metadata Element - Events |
Simple Output Value |
| Acquisition | "M&A" |
| Alliance | "Business Partnership" |
| AnalystEarningsEstimate | "Earnings Estimate" |
| AnalystRecommendation | "Analyst Recommendation" |
| Bankruptcy | "Bankruptcy" |
| BonusShares | "Bonus Shares" |
| BusinessRelation | "Business Partnership" |
| Buybacks | "Security Buyback" |
| CompanyEarningsAnnouncement | "Earnings Announcement" |
| CompanyEarningsGuidance | "Earnings Guidance" |
| CompanyInvestment | "Funding" |
| CompanyLegalIssues | "Legal Issues" |
| CompanyMeeting | "General or Shareholder Meeting" |
| CompanyReorganization | "Reorganization" |
| ConferenceCall | "Conference Call" |
| CreditRating | "Credit Rating" |
| IPO | "IPO" |
| JointVenture | "Business Partnership" |
| ManagementChange | "Management Change" |
| Merger | "M&A" |
| MovieRelease | "Movie Release" |
| MusicAlbumRelease | "Music Album Release" |
| PersonTravel | "Person Travel" |
| PersonCommunication | "Person Communication and Meetings" |
| StockSplit | "Stock Split" |
Information from Company and Geography disambiguation is integrated in the Simple Format output. Each resolved name is added as a value of the "normalized" attribute of the main entity (Company, City, ProvinceOrState, Country). For unresolved names, the "normalized" attribute and other information (such as geographical coordinates, ticker, etc.) is not shown in Simple Format output.
Examples:
<City count="1" relevance="0.147"
normalized="Golden,Colorado,United States">Golden>/City>
<Company count="3" relevance="0.519"
normalized="Google Inc.">Google>/Company>
Results in the simple output format include two forms:
- XML format: Entities and Events are given as an XML block. Each unique name is a separate XML node that also includes the count (frequency) of the name in the given input content.
- Semi-structured text format: Entities and Events are given in a semi-structured textual block (appears as an XML comment). The delimiting character is \t (tab).
Examples:
Text Input:
Tensleep Corporation (Other OTC:TENS.PK - News) ("Tensleep") announced that with the acquisition of XSTV Media, Inc. ("XSTV"),
it will become an online independent sports company. The transaction is
to close on or before September 15, 2007. Tensleep will, by the end of
this week or early next week, call a special meeting of shareholders to
approve the change name to "XSTV Corporation."
David Bailey, an analyst at Gerard Klauer Mattison, said such cuts "could include head count reductions."
Layoffs to some degree are inevitable, said IDC analyst Roger Kay. For years,
the company enjoyed a lower cost structure than other PC makers because
it sold computers directly.
International Star Inc. (OTC BB: ILST) announced that the annual meeting of
shareholders of International Star Inc. will be held on May 19, 2008,
at 3:00 p.m. (local time) at The Hilton Hotel, 104 Market Street,
Shreveport, La., 71101.Simple Output Format:
<!--
Use of the Calais Web Service is governed by the Terms of Service located at http://www.opencalais.com. By using this service or the results of the service you agree to these terms of service.
-->
<!--City: Shreveport,
Company: Tensleep Corporation, International Star Inc., XSTV Media Inc., XSTV Corporation, Gerard Klauer Mattison, IDC,
Event: General or Shareholder Meeting, M&A,
Facility: The Hilton Hotel,
IndustryTerm: sports,
Person: David Bailey, Roger Kay,
--><OpenCalaisSimple>
<Description>
<allowDistribution>true</allowDistribution>
<allowSearch>true</allowSearch>
<calaisRequestID>b5452ae4-1d6f-4012-a644-3662d347ccb6</calaisRequestID>
<externalID></externalID>
<id>http://id.opencalais.com/nhhvv9ffTGi*sD7Dk5j83Q</id>
<about>http://d.opencalais.com/dochash-1/c0805f22-d2d4-390e-ad40-42e903cd60cf</about>
</Description>
<CalaisSimpleOutputFormat>
<Company count="4" relevance="0.406" normalized="Epic Corporation">Tensleep Corporation</Company>
<Company count="3" relevance="0.123" normalized="International Star, Inc.">International Star Inc.</Company>
<Company count="2" relevance="0.344" normalized="XSTV Media Inc">XSTV Media Inc.</Company>
<Event count="2">General or Shareholder Meeting</Event>
<City count="1" relevance="0.089" normalized="Shreveport,Louisiana,United States">Shreveport</City>
<Company count="1" relevance="0.297">XSTV Corporation</Company>
<Company count="1" relevance="0.224" normalized="Harris Nesbitt Gerard, Inc">Gerard Klauer Mattison
<Company count="1" relevance="0.146" normalized="Investment Development Co">IDC</Company>
<Facility count="1" relevance="0.089">The Hilton Hotel</Facility>
<IndustryTerm count="1" relevance="0.327">sports</IndustryTerm>
<Person count="1" relevance="0.224">David Bailey</Person>
<Person count="1" relevance="0.146">Roger Kay</Person>
<Event count="1">M&A</Event>
<Topics>
<Topic Score="1.000" Taxonomy="Calais">Business_Finance</Topic>
</Topics>
</CalaisSimpleOutputFormat>
</OpenCalaisSimple>Simple Format Schema
<!-- Comment-->
<OpenCalaisSimple></OpenCalaisSimple>
<Description>
-- transaction details
</Description>
<CalaisSimpleOutputFormat>
-- list of entities in the form:
<entity name count="occurrences count" "relevance=score value">entity value</entity name>
--list of events in the form:
<Event count="occurrences count">even name</Event>
--list of topics in the form:
<Topic Score="score value" Taxonomy="Calais">topic name</Topic>
</CalaisSimpleOutputFormat>
