2016-12-25 - Though these notes were researched when Compiled Website was live, they are universal.
See also:
-
After various validations pass, and I hand-check things according to anybrowser, add the following into the footer:
<a href="https://anybrowser.org/campaign/">Viewable With Your Favorite Browser</a>
- Later, write an essay on this topic and link to it from the footer, and instead link to anybrowser.org from the essay.
HTML ∞
HTML Validation ∞
w3.org ∞
<a href="https://validator.w3.org/check?uri=referer"> <img src="i/valid-xhtml10.png" alt="Valid XHTML 1.0 Transitional" height="31" width="88" /> </a>
CSS validation ∞
Link checking ∞
- w3
I don't understand why their checker doesn't allow checking of its own links, probably for traffic reasons. I have no robots.txt
disallow for them.
JavaScript checking ∞
See JavaScript for the list of features used with it.
My knowledge of JavaScript is sorely lacking. There are a few things I've copied from elsewhere which are pretty critical which are not standards-compliant. I just don't know enough to fix or replace what I'm doing..
A HREF function-links ∞
JavaScript function-links are not actually valid. I've tried all sorts of stuff, but the best I can do is to wrap such things inside of <script type="text/javascript">
so that it'll only appear when JavaScript is enabled. Example link:
<a href="javascript:toggle('styles')">
To fix the validation issue, I started doing:
<a accesskey="t" href="/javascript.html#s0" onClick="javascript:toggle('styles');return false">Styles</a>
.. this forces me to have the link though. While this isn't really what I wanted, it does give an opportunity to link to another page explaining what JavaScript would have allowed the user to do.
Using HTML with document.write
∞
JavaScript document.write
technically shouldn't have HTML opening tags within it. HTML Tidy will escape any forward slashes in opening tags, ruining the code. I've never found a way around this. If I put only ending HTML tags inside the JavaScript - which is valid - and the other text outside, I still get validation errors. I have found no way around this.
Example code:
<script type="text/javascript"><!-- var heredoc = (<r><![CDATA[ <p>some example text</p> ]]></r>).toString(); document.write(heredoc); //--></script> <noscript> </noscript>
robots.txt
checkers ∞
-
- https://web.archive.org/web/20201023165946/http://tool.motoricerca.info/robots-checker.phtml
- https://web.archive.org/web/20130805085933/http://www.invision-graphics.com/robotstxt_validator.html
- https://web.archive.org/web/20120709084554/http://www.targetable.com:80/scripts/robotstxt.html
-
https://web.archive.org/web/20201023165946/http://tool.motoricerca.info/robots-checker.phtml
- Specific to hosts
-
- See answer 156449
- .. check your own host to see if they have tools.
Notes on specific robots ∞
- https://archive.org/
- info: https://web.archive.org/web/20200424140256/https://support.alexa.com/hc/en-us/articles/200450194
-
User-agent: ia_archiver
- https://www.google.com/, https://www.google.com/imghp, https://www.google.com/videohp, etc: https://about.google/intl/en/products/
- https://search.aol.com/, https://search.aol.com/aol/imagehome, https://www.aol.com/video/
- info: Googlebot
- I can't find direct information, try around here: 8456
- submit: https://developers.google.com/search/docs/crawling-indexing/ask-google-to-recrawl [ 1 ] [ 2 ]
-
User-agent: Googlebot
Microsoft / Yahoo [ 3 ] :
- www.bing.com / live.com
-
- Yahoo! Directory was retired.
-
info: Msnbot
- I can't even give a direct URL to official information on it.
- submit: https://www.bing.com/webmasters/about [ 5 ] [ 6 ]
-
User-agent: msnbot
[ 7 ]
Sitemap ∞
In robots.txt
, add something like:
Sitemap: http://example.com/sitemap.xml
Multiple lines for multiple sitemaps is allowed.
RSS ∞
(RSS)
(Not implemented yet)
Server ∞
Since this engine doesn't really care about the functionality of the server, I don't have much to say about it.
Misc. notes:
- Custom error documents
- Logging and statistics
-
Security settings
Email ∞
I'm not using self-hosted email right now.
Last updated 2022-12-24 at 06:24:02
Footnotes
- was https://developers.google.com/search/docs/advanced/crawling/ask-google-to-recrawl [ ↩ ]
- was https://www.google.com/webmasters/tools/submit-url [ ↩ ]
- They also absorbed Altavista [ ↩ ]
- Yahoo switched to Microsoft's bing.com engine: __ [ ↩ ]
- was https://www.bing.com/webmasters/homepage [ ↩ ]
- was https://www.bing.com/toolbox/webmaster/ [ ↩ ]
- Yahoo uses or used
User-agent: Slurp
[ ↩ ] - was https://support.google.com/webmasters/answer/156184 [ ↩ ]