-
Webpage Language Attribute
I’ve been wanting to flex my computing skills and make a sort of search engine I found some really helpful stuff. I like the idea of sitemaps and “robots.txt”, if anyone else has got pointer to any other SEO’s then I would be delighted!
My question is how do detect a webpages primary language, as I found most pages seem to just stick a lang=’en’ in their html attribute even if they are not an english page. Is there other “header/head” information that I can parse to detect language, I don’t want to actually parse a complete page to detect the language, reading every character and seeing if it’s a English character or not.
Sorry, there were no replies found.
Log in to reply.