summaryrefslogtreecommitdiffstats
AgeCommit message (Expand)Author
2022-06-20Re-implement selector generation with a speedup >10xdevelopKarl Lorey
2022-06-19Speed up training by filtering overlapping matches and preferring deep matchesKarl Lorey
2022-06-19Fix css selector generation by adding tag name and avoiding empty selectorKarl Lorey
2022-06-17Fix ListScraper and introduce maximum complexity parameterKarl Lorey
2022-06-15Revert previous commit and add comment whyKarl Lorey
2022-06-15Loop through all possible selectors when training ListScraperKarl Lorey
2022-06-15Rewrite training module to decrease complexityKarl Lorey
2022-06-15Implement a factory for each page to solve identity and equality issues for nowKarl Lorey
2022-06-14I might go insane with this oneKarl Lorey
2022-06-14Use stackoverflow fixture throughout all testsKarl Lorey
2022-06-14Read stackoverflow sample with rb to get actual bytesKarl Lorey
2022-06-14Add missing conftest.pyKarl Lorey
2022-06-14Pull stackoverflow test sample to module levelKarl Lorey
2022-06-13Try to install lmxl dependendencies during CIKarl Lorey
2022-06-13Adapt python versions to 3.9+Karl Lorey
2022-06-13Swtich from travis to Github Actions, pre-commit, and toxKarl Lorey
2022-06-13Apply black and other stylingKarl Lorey
2022-06-13Import mlscraper-experimentsKarl Lorey
2020-10-03Circumvent aggressive caching to display updated badgesKarl Lorey
2020-09-29I have to start testing example code...Karl Lorey
2020-09-28Use a wide image in readme to save spaceKarl Lorey
2020-09-28Fix readme code sampleKarl Lorey
2020-09-28Update readme with rule-based scraper as this is easier to get goingKarl Lorey
2020-09-28Update readme with improved descriptionKarl Lorey
2020-09-27Update examples with stackoverflow scraperKarl Lorey
2020-09-27Add upcoming headline to HISTORY.rst and add bump2version ruleKarl Lorey
2020-09-27Update README with badges and installation instructionsKarl Lorey
2020-09-27Bump version: 0.1.1 → 0.1.2v0.1.2Karl Lorey
2020-09-27Fix history formatting to prevent pypi rejectionKarl Lorey
2020-09-27Bump version: 0.1.0 → 0.1.1v0.1.1Karl Lorey
2020-09-27Fix username for token authentication in travis configKarl Lorey
2020-09-27Bump version: 0.0.0 → 0.1.0v0.1.0Karl Lorey
2020-09-27Adapt bumpversion to black formattingKarl Lorey
2020-09-27Set verion to 0.0.0 to use bumpversionKarl Lorey
2020-09-27Add travis badge to readmeKarl Lorey
2020-09-27Fix missing lxml requirementKarl Lorey
2020-09-27Add PyPI credentialsKarl Lorey
2020-09-27Prepare python package with cookiecutter templateKarl Lorey
2020-09-26Increase log level for make-based test runsKarl Lorey
2020-09-26Add test for id-based selectorsKarl Lorey
2020-09-26Use heuristic to prefer simple rules in single-item scraperKarl Lorey
2020-09-26Skip unstable testKarl Lorey
2020-09-26Fix quote_to_scrape exampleKarl Lorey
2020-09-26Simplify interfaces even more (plain html only)Karl Lorey
2020-09-26Update README with new interfacesKarl Lorey
2020-09-26Unify interfacesKarl Lorey
2020-09-26Generate id and child-based selectorsKarl Lorey
2020-09-26Make rule-based selection faster and more robustKarl Lorey
2020-09-26Add test set generation for single item pagesKarl Lorey
2020-09-25Improved basic scraper to be fully functionalKarl Lorey