diff options
author | Sam Tay <sam.chong.tay@gmail.com> | 2020-06-23 21:09:35 -0700 |
---|---|---|
committer | Sam Tay <sam.chong.tay@gmail.com> | 2020-06-23 21:09:35 -0700 |
commit | 74bda95681c253eea5010417dc74e8569010f7f9 (patch) | |
tree | dbb7168aab62e75c7ccbfcf3e109b5ac6dafe902 /TODO.md | |
parent | 0c4bafb3eb996b0e70707a32c11e8a1a2f9572ba (diff) |
Maintain order of duckduckgo search resultsv0.3.0
Diffstat (limited to 'TODO.md')
-rw-r--r-- | TODO.md | 22 |
1 files changed, 7 insertions, 15 deletions
@@ -1,8 +1,5 @@ # TODO -### v0.3.0 -1. Keep relevance ordering !!! - ### v0.3.1 1. Much of the code can be reused for google: * parsing href after `"url="` (similar to uddg) @@ -15,9 +12,6 @@ - api - scraper - - - ### Endless future improvements for the TUI 1. Init with smaller layout depending on initial screen size. 2. Maybe cli `--auto-resize` option. @@ -32,17 +26,15 @@ ### resources for later #### scraping -6. Google stuff [scraping with reqwest](https://rust-lang-nursery.github.io/rust-cookbook/web/scraping.html)) - ```python -# if necessary, choose one of these to mimic browswer request +# if necessary, choose one of these to mimic browser request USER_AGENTS = ('Mozilla/5.0 (Macintosh; Intel Mac OS X 10.7; rv:11.0) Gecko/20100101 Firefox/11.0', - 'Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:22.0) Gecko/20100 101 Firefox/22.0', - 'Mozilla/5.0 (Windows NT 6.1; rv:11.0) Gecko/20100101 Firefox/11.0', - ('Mozilla/5.0 (Macintosh; Intel Mac OS X 10_7_4) AppleWebKit/536.5 (KHTML, like Gecko) ' - 'Chrome/19.0.1084.46 Safari/536.5'), - ('Mozilla/5.0 (Windows; Windows NT 6.1) AppleWebKit/536.5 (KHTML, like Gecko) Chrome/19.0.1084.46' - 'Safari/536.5'), ) +'Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:22.0) Gecko/20100 101 Firefox/22.0', +'Mozilla/5.0 (Windows NT 6.1; rv:11.0) Gecko/20100101 Firefox/11.0', +('Mozilla/5.0 (Macintosh; Intel Mac OS X 10_7_4) AppleWebKit/536.5 (KHTML, like Gecko) ' +'Chrome/19.0.1084.46 Safari/536.5'), +('Mozilla/5.0 (Windows; Windows NT 6.1) AppleWebKit/536.5 (KHTML, like Gecko) Chrome/19.0.1084.46' +'Safari/536.5'), ) # checks for search engine blocks BLOCK_INDICATORS = ( |