index
:
mlscraper
dependabot/pip/requirements/certifi-2022.12.7
dependabot/pip/requirements/cryptography-39.0.1
dependabot/pip/requirements/wheel-0.38.1
develop
master
Mirror of https://github.com/lorey/mlscraper
matthias
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
Age
Commit message (
Expand
)
Author
2020-09-25
Fix duplicated training runs for attributes in simple scraper
Karl Lorey
2020-09-25
Introduce new object model with rule-based scraper
Karl Lorey
2020-09-24
Improve speed by sampling and limiting generated css paths
Karl Lorey
2020-09-24
Turn generator to list for easier debugging when testing generators
Karl Lorey
2020-09-23
Renaming again to avoid name collision :(
Karl Lorey
2020-08-01
Extend README with picture and installation instructions
Karl Lorey
2020-07-31
Add example for quotes.toscrape.com
Karl Lorey
2020-07-31
Add setup.py
Karl Lorey
2020-07-31
Ignore whitespace when extracting text of tags
Karl Lorey
2020-07-31
Renaming to autoscraper to avoid name collision
Karl Lorey
2020-07-31
Use css selectors for classification
Karl Lorey
2020-07-31
Add generators for css selectors based on a path of nodes
Karl Lorey
2020-07-30
Add ML-based scraping for one item per page
Karl Lorey
2020-07-30
Change build() method signature to have items first
Karl Lorey
2020-07-30
Introduce autoscrape.util module
Karl Lorey
2020-07-30
Improve README wording
Karl Lorey
2020-07-30
Turn module into package
Karl Lorey
2020-06-10
Add ML-based prototype
Karl Lorey
2020-06-10
Initial
Karl Lorey
[prev]