Settings

Theme

NoML Proposal: searchable on search engines, but not used for ML

noml.info

1 points by ErrrNoMate 2 years ago · 1 comment

Reader

ErrrNoMateOP 2 years ago

A specification for those who want content searchable on search engines, but not used for machine learning.

Publishers need improved ways to indicate how they want content to be used in search and machine learning. Using robots.txt does not cover all use cases, and so a complementary approach is needed as proposed here. It is one which can be applied to individual webpages as desired, and can be preserved as such in datasets of web content.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection