index
:
mlscraper
dependabot/pip/requirements/certifi-2022.12.7
dependabot/pip/requirements/cryptography-39.0.1
dependabot/pip/requirements/wheel-0.38.1
develop
master
Mirror of https://github.com/lorey/mlscraper
matthias
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
tests
Age
Commit message (
Expand
)
Author
2022-08-18
Fix testing examples by only running them in specialized environment
Karl Lorey
2022-08-18
Minor: fix formatting
Karl Lorey
2022-08-18
Test code in example folder (#28)
Leonardo Tarla
2022-07-07
Add test for nbsp issue #15
Karl Lorey
2022-07-06
Move functionality to html module and fix minor errors with selector generation
Karl Lorey
2022-07-06
Minor performance improvements
Karl Lorey
2022-07-06
Generate selectors faster by leveraging recursion and caching
Karl Lorey
2022-07-05
Minor fixes and improvements
Karl Lorey
2022-06-24
Add tests for github profiles
Karl Lorey
2022-06-24
Avoid matching numbers inside image dimensions
Karl Lorey
2022-06-24
Add nth-child selector generation
Karl Lorey
2022-06-24
Also match all parents that contain the same text
Karl Lorey
2022-06-23
Add child selectors for CSS generation
Karl Lorey
2022-06-23
Improve performance by fixing hashing and root computation
Karl Lorey
2022-06-23
Add attribute-based CSS selectors
Karl Lorey
2022-06-23
Fix selection of arbitrary text within nodes (for now)
Karl Lorey
2022-06-21
Ignore whitespace around values when searching for matches in HTML
Karl Lorey
2022-06-20
Apply python 3.9+ features
Karl Lorey
2022-06-20
Re-implement selector generation with a speedup >10x
develop
Karl Lorey
2022-06-19
Speed up training by filtering overlapping matches and preferring deep matches
Karl Lorey
2022-06-19
Fix css selector generation by adding tag name and avoiding empty selector
Karl Lorey
2022-06-17
Fix ListScraper and introduce maximum complexity parameter
Karl Lorey
2022-06-15
Loop through all possible selectors when training ListScraper
Karl Lorey
2022-06-15
Rewrite training module to decrease complexity
Karl Lorey
2022-06-15
Implement a factory for each page to solve identity and equality issues for now
Karl Lorey
2022-06-14
I might go insane with this one
Karl Lorey
2022-06-14
Use stackoverflow fixture throughout all tests
Karl Lorey
2022-06-14
Read stackoverflow sample with rb to get actual bytes
Karl Lorey
2022-06-14
Add missing conftest.py
Karl Lorey
2022-06-14
Pull stackoverflow test sample to module level
Karl Lorey
2022-06-13
Adapt python versions to 3.9+
Karl Lorey
2022-06-13
Apply black and other styling
Karl Lorey
2022-06-13
Import mlscraper-experiments
Karl Lorey
2020-09-27
Prepare python package with cookiecutter template
Karl Lorey