error 500 using semantic proxy : not sure if it's our side or not??
error 500 using semantic proxy : not sure if it's our side or not??
Posted on: Tue, 05/05/2009 - 13:42
Hello there, Alexis here having fun using Calais api!! Just a question, we have several of these errors in our log:
SemanticProxy processing error: (500 - Internal Server Error) Exception while fetching page from : http://rss.slashdot.org/~r/Slashdot/slashdot/~3/LddK8tUOow4/article.pl
We are not sure if these errors actually come from your service of from our site?? Any ideas to where to look?
Thanks a lot!
Trackback URL for this post:
http://www.opencalais.com/trackback/21399

I'm getting the same error as well:
SemanticProxy processing error: Exception while fetching page from : http://...
P.S. I tried using the SemanticProxy.com to test those URLs and they fail there as well. Here is an example of one of the URLs that fail from my RSS feed http://www.azcentral.com/rsslinks/1153286
I also tried a URL from my own Drupal site, and it worked just fine... may be this will give you an idea on what the problem might be.
Hi,
When you submit an http url to the semantic proxy service, it first fetches the page from the server and then processes it with Open Calais. When fetching pages we respect the remote server's access restrictions (commonly known as the robots.txt file). If a site does not allow http access, we will not fetch the page.
The site that you are trying http://www.azcentral.com is not allowing http access by our agent. Ideally, in the returned RDF you should see the error message as below. If you do not see this let us know and we will look into it.
<Exception>
Exception while fetching page from : http://www.azcentral.com/news/articles/2009/08/13/20090813poll0813.html
</Exception>
thanks.
sumit
Oh, I see. They provide dozens of RSS feeds, but don't allow acess to those pages. Interesting.
Thanks a lot for clearing this up for me.
I didn't look into the returned RDF messages - the Calais Drupal module only returned an error - it didn't give me specific details (which probably would make a sense to add).
Hi,
Just to confirim, those RSS feeds will be accessible by your browser or other RSS readers. But when a web agent (software component) tries to access the page, its blocked.
We will make it a note for our Drupal plugin to propogate the appropriate error message.
Thanks.
sumit
Hey there, it seems lately, all the nodes we are sending to the SemanticProxy return with this error 500. Are there any specific problems with SemanticProxy these days? is there anything we can do to help? For example submitting less nodes per hour, or something similar??
thanks
Alexis
Hi,
We are aware of some issues with our caching and working to fix this.
In the meantime, I recommend that once you receive such errors, please resubmit the request.
Let us know if this works for you in the meantime.
Thanks,
Ofer