Simple Format

Description

Calais now supports a simple output format that provides a snapshot of the entire RDF content and facilitates the parsing and integration of the results.

The simple output includes all Entities and a subset of Events and Facts. The following table summarizes the metadata elements included in the simple output format. Entities are shown before Events and Facts.

Metadata Element Simple Output Value

Metadata Element - Entities
Simple Output Value
Anniversary value of attribute Anniversary
City value of attribute City
Company value of attribute Company
Continent value of attribute Continent
Country value of attribute Country
Currency value of attribute Currency
EmailAddress value of attribute EmailAddress
EntertainmentAwardEvent value of attribute EntertainmentEvent
Facility value of attribute Facility
FaxNumber value of attribute FaxNumber
Holiday value of attribute Holiday
IndustryTerm value of attribute IndustryTerm
MarketIndex value of attribute MarketIndex
MedicalCondition value of attribute MedicalCondition
MedicalTreatment value of attribute MedicalTreatment
Movie value of attribute Movie
MusicAlbum value of attribute MusicAlbum
MusicGroup value of attribute MusicGroup
NaturalDisaster value of attribute NaturalDisaster
NaturalFeature value of attribute NaturalFeature
OperatingSystem value of attribute OperatingSystem
Organization value of attribute Organization
Person value of attribute Person
PhoneNumber value of attribute PhoneNumber
Product value of attribute Product
ProgrammingLanguage value of attribute ProgrammingLanguage
ProvinceOrState value of attribute ProvinceOrState
PublishedMedium value of attribute PublishedMedium
RadioProgram value of attribute RadioProgram
RadioStation value of attribute RadioStation
Region value of attribute Region
SportsEvent value of attribute SportsEvent
SportsGame value of attribute SportsGame
SportsLeague value of attribute SportsLeague
TVShow value of attribute TVShow
TVStation value of attribute TVStation
Technology value of attribute Technology
URL value of attribute URL
Metadata Element - Events
Simple Output Value
Acquisition "M&A"
Alliance "Business Partnership"
AnalystEarningsEstimate "Earnings Estimate"
AnalystRecommendation "Analyst Recommendation"
Bankruptcy "Bankruptcy"
BonusShares "Bonus Shares"
BusinessRelation "Business Partnership"
Buybacks "Security Buyback"
CompanyEarningsAnnouncement "Earnings Announcement"
CompanyEarningsGuidance "Earnings Guidance"
CompanyInvestment "Funding"
CompanyLegalIssues "Legal Issues"
CompanyMeeting "General or Shareholder Meeting"
CompanyReorganization "Reorganization"
ConferenceCall "Conference Call"
CreditRating "Credit Rating"
IPO "IPO"
JointVenture "Business Partnership"
ManagementChange "Management Change"
Merger "M&A"
MovieRelease "Movie Release"
MusicAlbumRelease "Music Album Release"
PersonTravel "Person Travel"
PersonCommunication "Person Communication and Meetings"
StockSplit "Stock Split"

Information from Company and Geography disambiguation is integrated in the Simple Format output. Each resolved name is added as a value of the "normalized" attribute of the main entity (Company, City, ProvinceOrState, Country). For unresolved names, the "normalized" attribute and other information (such as geographical coordinates, ticker, etc.) is not shown in Simple Format output.

Examples:

<City count="1" relevance="0.147"
  normalized="Golden,Colorado,United States">Golden>/City>
<Company count="3" relevance="0.519"
  normalized="Google Inc.">Google>/Company>

Results in the simple output format include two forms:

  • XML format: Entities and Events are given as an XML block. Each unique name is a separate XML node that also includes the count (frequency) of the name in the given input content.
  • Semi-structured text format: Entities and Events are given in a semi-structured textual block (appears as an XML comment). The delimiting character is \t (tab).

Examples:

Text Input:

Tensleep Corporation (Other OTC:TENS.PK - News) ("Tensleep") announced that with the acquisition of XSTV Media, Inc. ("XSTV"),
it will become an online independent sports company. The transaction is
to close on or before September 15, 2007. Tensleep will, by the end of
this week or early next week, call a special meeting of shareholders to
approve the change name to "XSTV Corporation."

David Bailey, an analyst at Gerard Klauer Mattison, said such cuts "could include head count reductions."
Layoffs to some degree are inevitable, said IDC analyst Roger Kay. For years,
the company enjoyed a lower cost structure than other PC makers because
it sold computers directly.
International Star Inc. (OTC BB: ILST) announced that the annual meeting of
shareholders of International Star Inc. will be held on May 19, 2008,
at 3:00 p.m. (local time) at The Hilton Hotel, 104 Market Street,
Shreveport, La., 71101.

Simple Output Format:

<!-- 
Use of the Calais Web Service is governed by the Terms of Service located at http://www.opencalais.com. By using this service or the results of the service you agree to these terms of service.
-->
<!--City: Shreveport,
Company: Tensleep Corporation, International Star Inc., XSTV Media Inc., XSTV Corporation, Gerard Klauer Mattison, IDC,
Event: General or Shareholder Meeting, M&A,
Facility: The Hilton Hotel,
IndustryTerm: sports,
Person: David Bailey, Roger Kay,
--><OpenCalaisSimple>
<Description>
<allowDistribution>true</allowDistribution>
<allowSearch>true</allowSearch>
<calaisRequestID>b5452ae4-1d6f-4012-a644-3662d347ccb6</calaisRequestID>
<externalID></externalID>
<id>http://id.opencalais.com/nhhvv9ffTGi*sD7Dk5j83Q</id>
<about>http://d.opencalais.com/dochash-1/c0805f22-d2d4-390e-ad40-42e903cd60cf</about>
</Description>
<CalaisSimpleOutputFormat>
<Company count="4" relevance="0.406" normalized="Epic Corporation">Tensleep Corporation</Company>
<Company count="3" relevance="0.123" normalized="International Star, Inc.">International Star Inc.</Company>
<Company count="2" relevance="0.344" normalized="XSTV Media Inc">XSTV Media Inc.</Company>
<Event count="2">General or Shareholder Meeting</Event>
<City count="1" relevance="0.089" normalized="Shreveport,Louisiana,United States">Shreveport</City>
<Company count="1" relevance="0.297">XSTV Corporation</Company>
<Company count="1" relevance="0.224" normalized="Harris Nesbitt Gerard, Inc">Gerard Klauer Mattison
<Company count="1" relevance="0.146" normalized="Investment Development Co">IDC</Company>
<Facility count="1" relevance="0.089">The Hilton Hotel</Facility>
<IndustryTerm count="1" relevance="0.327">sports</IndustryTerm>
<Person count="1" relevance="0.224">David Bailey</Person>
<Person count="1" relevance="0.146">Roger Kay</Person>
<Event count="1">M&A</Event>
<Topics>
<Topic Score="1.000" Taxonomy="Calais">Business_Finance</Topic>
</Topics>
 </CalaisSimpleOutputFormat>
</OpenCalaisSimple>

Simple Format Schema

<!-- Comment-->
<OpenCalaisSimple>

<Description>
-- transaction details

</Description>
<CalaisSimpleOutputFormat>
-- list of entities in the form:
<entity name count="occurrences count" "relevance=score value">entity value</entity name>

 --list of events in the form:
<Event count="occurrences count">even name</Event>

 --list of topics in the form:
<Topic Score="score value" Taxonomy="Calais">topic name</Topic>
 </CalaisSimpleOutputFormat>
</OpenCalaisSimple>

Back to Top