Simple Format
Description
OpenCalais now supports a simple output format that provides a snapshot of the entire RDF content and facilitates the parsing and integration of the results.
The simple output includes all Entities and a subset of Events and Facts. The following table summarizes the metadata elements included in the simple output format. Entities are shown before Events and Facts.
Metadata Element Simple Output Value
| Metadata Element - Entities |
Simple Output Value |
| Anniversary | value of attribute Anniversary |
| City | value of attribute City |
| Company | value of attribute Company |
| Continent | value of attribute Continent |
| Country | value of attribute Country |
| Currency | value of attribute Currency |
| EmailAddress | value of attribute EmailAddress |
| EntertainmentAwardEvent | value of attribute EntertainmentEvent |
| Facility | value of attribute Facility |
| FaxNumber | value of attribute FaxNumber |
| Holiday | value of attribute Holiday |
| IndustryTerm | value of attribute IndustryTerm |
| MarketIndex | value of attribute MarketIndex |
| MedicalCondition | value of attribute MedicalCondition |
| MedicalTreatment | value of attribute MedicalTreatment |
| Movie | value of attribute Movie |
| MusicAlbum | value of attribute MusicAlbum |
| MusicGroup | value of attribute MusicGroup |
| NaturalFeature | value of attribute NaturalFeature |
| OperatingSystem | value of attribute OperatingSystem |
| Organization | value of attribute Organization |
| Person | value of attribute Person |
| PhoneNumber | value of attribute PhoneNumber |
| PoliticalEvent | value of attribute PoliticalEvent |
| Product | value of attribute Product |
| ProgrammingLanguage | value of attribute ProgrammingLanguage |
| ProvinceOrState | value of attribute ProvinceOrState |
| PublishedMedium | value of attribute PublishedMedium |
| RadioProgram | value of attribute RadioProgram |
| RadioStation | value of attribute RadioStation |
| Region | value of attribute Region |
| SportsEvent | value of attribute SportsEvent |
| SportsGame | value of attribute SportsGame |
| SportsLeague | value of attribute SportsLeague |
| TVShow | value of attribute TVShow |
| TVStation | value of attribute TVStation |
| Technology | value of attribute Technology |
| URL | value of attribute URL |
| Metadata Element - Events |
Simple Output Value |
| Acquisition | "M&A" |
| Alliance | "Business Partnership" |
| AnalystEarningsEstimate | "Earnings Estimate" |
| AnalystRecommendation | "Analyst Recommendation" |
| Arrest | "Judicial Event" |
| Bankruptcy | "Bankruptcy" |
| BonusSharesIssuance | "Bonus Shares Issuance" |
| BusinessRelation | "Business Partnership" |
| Buybacks | "Security Buyback" |
| CompanyAccountingChange | "Accounting Change" |
| CompanyEarningsAnnouncement | "Earnings Announcement" |
| CompanyEarningsGuidance | "Earnings Guidance" |
| CompanyExpansion | "Company Expansion" |
| CompanyForceMajeure | "Force Majeure" |
| CompanyInvestment | "Funding" |
| CompanyLaborIssues | "Labor Issues" |
| CompanyLayoffs | "Layoffs" |
| CompanyLegalIssues | "Legal Issues" |
| CompanyListingChange | "Listing Change" |
| CompanyMeeting | "General or Shareholder Meeting" |
| CompanyNameChange | "Name Change" |
| CompanyReorganization | "Reorganization" |
| CompanyRestatement | "Financials Restatement" |
| ConferenceCall | "Conference Call" |
| Conviction | "Judicial Event" |
| CreditRating | "Credit Rating" |
| DebtFinancing | "Debt Financing" |
| DelayedFiling | "Delayed Filing" |
| DiplomaticRelations | "Diplomatic Relations" |
| Dividend | "Dividend" |
| EmploymentChange | "Employment Change" |
| EnvironmentalIssue | "Environmental Issues" |
| Extinction | "Extinction" |
| FDAPhase | "FDA Phase" |
| Indictment | "Judicial Event" |
| IPO | "IPO" |
| JointVenture | "Business Partnership" |
| ManMadeDisaster | "Man-Made Disaster" |
| Merger | "M&A" |
| MovieRelease | "Movie Release" |
| MusicAlbumRelease | "Music Album Release" |
| NaturalDisaster | "Natural Disaster" |
| PatentFiling | "Patent Filing" |
| PatentIssuance | "Patent Issuance" |
| PersonTravel | "Person Travel" |
| PoliticalEndorsement | "Political Endorsement" |
| PoliticalRelationship | "Political Relationship" |
| PollsResult | "Polls Result" |
| ProductIssues | "Product Issues" |
| ProductRecall | "Product Recall" |
| ProductRelease | "Product Release" |
| SocialTags | Value of each identified Social Tag |
| SecondaryIssuance | "Second Stock Issuance" |
| PersonCommunication | "Person Communication and Meetings" |
| StockSplit | "Stock Split" |
| Trial | "Judicial Event" |
| VotingResult | "Voting Result" |
Information from Company and Geography disambiguation is integrated in the Simple Format output. Each resolved name is added as a value of the "normalized" attribute of the main entity (Company, City, ProvinceOrState, Country). For unresolved names, the "normalized" attribute and other information (such as geographical coordinates, ticker, etc.) is not shown in Simple Format output.
Examples:
<City count="1" relevance="0.147" normalized="Golden,Colorado,United States">Golden</City>
<Company count="3" relevance="0.519" normalized="Google Inc.">Google</Company>
Results in the simple output format include two forms:
- XML format: Entities and Events are given as an XML block. Each unique name is a separate XML node that also includes the count (frequency) of the name in the given input content.
- Semi-structured text format: Entities and Events are given in a semi-structured textual block (appears as an XML comment). The delimiting character is \t (tab).
Examples:
Text Input:
Tensleep Corporation (Other OTC:TENS.PK - News) ("Tensleep") announced that with the acquisition of XSTV Media, Inc. ("XSTV"),
it will become an online independent sports company. The transaction is
to close on or before September 15, 2007. Tensleep will, by the end of
this week or early next week, call a special meeting of shareholders to
approve the change name to "XSTV Corporation."
David Bailey, an analyst at Gerard Klauer Mattison, said such cuts "could include head count reductions."
Layoffs to some degree are inevitable, said IDC analyst Roger Kay. For years,
the company enjoyed a lower cost structure than other PC makers because
it sold computers directly.
International Star Inc. (OTC BB: ILST) announced that the annual meeting of
shareholders of International Star Inc. will be held on May 19, 2008,
at 3:00 p.m. (local time) at The Hilton Hotel, 104 Market Street,
Shreveport, La., 71101.Simple Output Format:
<!--Use of the Calais Web Service is governed by the Terms of Service located at http://www.opencalais.com. By using this service or the results of the service you agree to these terms of service.-->
<!--
City: Shreveport,
Company: Tensleep Corporation,XSTV Corporation,International Star Inc.,Gerard Klauer Mattison,XSTV Media Inc.,IDC,
Event: M&A,General or Shareholder Meeting,
Facility: The Hilton Hotel,
IndustryTerm: sports,
Person: David Bailey,Roger Kay,
Position: analyst,
-->
<OpenCalaisSimple>
<Description>
<allowDistribution>true</allowDistribution>
<allowSearch>true</allowSearch>
<calaisRequestID>766b3bd5-97ca-2c44-121a-ab2691016c8a</calaisRequestID>
<externalID>QA</externalID>
<id>http://id.opencalais.com/ZwD-c-aYZMsFWaSXn2vWbw</id>
<about>http://d.opencalais.com/dochash-1/e12aabaf-1b01-3844-96bc-90238dae24dc</about>
<docTitle/>
<docDate>2007-09-15</docDate>
</Description>
<CalaisSimpleOutputFormat>
<Company count="4" relevance="0.445" normalized="Epic Corporation">Tensleep Corporation</Company>
<Company count="3" relevance="0.138" normalized="International Star, Inc.">International Star Inc.</Company>
<Company count="2" relevance="0.361" normalized="XSTV Media Inc">XSTV Media Inc.</Company>
<Event count="2">General or Shareholder Meeting</Event>
<Position count="2" relevance="0.329">analyst</Position>
<City count="1" relevance="0.111" normalized="Shreveport,Louisiana,United States">Shreveport</City>
<Company count="1" relevance="0.318">XSTV Corporation</Company>
<Company count="1" relevance="0.245" normalized="Harris Nesbitt Gerard, Inc">Gerard Klauer Mattison</Company>
<Company count="1" relevance="0.167" normalized="I.D.C. Holding a.s.">IDC</Company>
<Event count="1">M&A</Event>
<Facility count="1" relevance="0.111">The Hilton Hotel</Facility>
<IndustryTerm count="1" relevance="0.361">sports</IndustryTerm>
<Person count="1" relevance="0.245">David Bailey</Person>
<Person count="1" relevance="0.167">Roger Kay</Person>
<Topics>
<Topic Taxonomy="Calais" Score="0.934">Business_Finance</Topic>
<Topic Taxonomy="Calais" Score="0.733">Technology_Internet</Topic>
</Topics>
</CalaisSimpleOutputFormat>
</OpenCalaisSimple>
Simple Format Schema:
<!-- Comment-->
<OpenCalaisSimple>
<Description>
-- transaction details
</Description>
<CalaisSimpleOutputFormat>
-- list of resolved entities in the form:
<entity name count="occurrences count" relevance="score value" normalized="resolved name">entity value</entity name>
-- list of non-resolved entities in the form:
<entity name count="occurrences count" "relevance=score value">entity value</entity name>
--list of events in the form:
<Event count="occurrences count">event name</Event>
-- list of topics in the form:
<Topic Score="score value" Taxonomy="Calais">topic name</Topic>
</CalaisSimpleOutputFormat>
</OpenCalaisSimple>
