Monday, January 2, 2017

SEO Rankings are based on content, which is primarily the text on the webpages

What looks like text - may not be readable by the Google Spider.
Since content determines SEO Rankings for a website, unreadable content will kill any chance for rankings.
There is text and then there is picture of text - do you know the difference?
Text readable to humans and to the Google Spider is the kind of content that is the most desirable for communicating to your visitors, as well as enabling the Spider process the website contents and determine how to Index and Rank the website.
A picture of text visible in a PDF document, may well be readable to humans, yet sometimes not be "readable" by the Spider.
So whats the difference?
That depends on how the PDF was created.
If the PDF document was created directly from a Text or Word document, the text is actually included within the PDF itself, which you can see if you "Export" the PDF to text.
However if the PDF was created from a "Screen-Capture" of text or created from a Photoshop image, the text itself is not included within the PDF and therefore the Spider may not be able to "read" and index it for the content.
Google has noted that it is working on Optical Character Recognition (OCR) of PDF documents, but that is not a certainty and OCR has never proven to be anywhere near 100% accurate.
In addition if the page content is purely text derived from within a PDF, it will not have any of the HTML characteristics that can be assigned and are used by the Spider to determine significance and importance, like header H-Tags, Paragraph headings, etc.
So beware, its far better to have fully HTML formatted text for your webpages, rather that a PDF image of the page text in order to make sure your website content is fully indexed in order to get it ranked.
"just saying..."

No comments:

Post a Comment