Search Engine Optimization
Google Search Engine Optimzation
http://infolab.stanford.edu/~backrub/google.html1 submit domain and page URL
Improve key phrase matching
- Put key words in title, meta tags, anchor, URL, and as captalized words, with large, bold fonts as close to beginning of body as possible
- Put as many as possible related key words in above-cited way
- Consider if possible to use text on the page which is not directly represented to the user
Improve page ranking
- Create as many as possible links in high-quality sites that point to your pages
- In the anchor text of the links created above, describe the page with the key words as described in "Improve key phrase matching".
- Submit domain and page URL to search engines
- How to add google search to your site https://www.google.com/cse/?hl=en
- Create sitemap XML and keep it updated
- Ensure your title tags and alt attributes of your image tags have the key words you want
- Create external links, with descriptive anchors, from high-ranking pages to your pages
- With the robots.txt file, site owners can choose not to be crawled, indexed, cached, etc. by Googlebot or other crawlers.
- When creating pages, pay attention to terms used in a page, freshness of the page, region
Google search algorithms
- External meta information: reputation of the source, update frequency, quality, popularity or usage, and citations
- Google has location information for all hits and so it makes extensive use of proximity in search
- Google keeps track of some visual presentation details such as font size of words. Words in a larger or bolder font are weighted higher than other words
- 12. external meta information are information that can be inferred about a document, but is not contained within it. Examples of external meta information
- Include things like reputation of the source, update frequency, quality, popularity or usage, and citations.
- Also, it is interesting to note that metadata efforts have largely failed with web search engines, because any text on the page which is not directly represented to the user is abused to manipulate search engines.
- A hit list corresponds to a list of occurrences of a particular word in a particular document including position, font, and capitalization information.
- There are two types of hits: fancy hits and plain hits. Fancy hits include hits occurring in a URL, title, anchor text, or meta tag. Plain hits include everything else.
- A plain hit consists of a capitalization bit, font size, and 12 bits of word position in a document (all positions higher than 4095 are labeled 4096).
- We use font size relative to the rest of the document because when searching, you do not want to rank otherwise identical documents differently just because one of the documents is in a larger font.
- Every hitlist includes position, font, and capitalization information. Additionally, we factor in hits from anchor text and the PageRank of the document.
- Scan through the doclists until there is a document that matches all the search terms.
- Sort the documents that have matched by rank and return the top k.