59 Commits

Author SHA1 Message Date
a23429104c dead code removal 2025-03-21 06:03:34 +00:00
66581cc453 getting there 2025-03-21 05:59:40 +00:00
7df19a480f updates 2025-03-20 15:11:01 -06:00
b9c1f0b492 readme updates 2025-03-19 15:05:32 -06:00
71b7b2d7bc it works and it is awesome 2025-03-19 15:04:00 -06:00
bac3cd9d1d add most recent long run 2025-03-19 15:03:49 -06:00
1f6a0acce3 shutup spellchecker 2025-03-19 15:03:39 -06:00
53dbf53ab9 newest settings 2025-03-19 15:03:24 -06:00
0477bb26e4 viz improvements 2025-03-19 15:03:11 -06:00
6409baaffb Reducted trips to surreal by x500 2025-03-19 12:41:08 -06:00
135a7e4957 Merge pull request 'multithreading' (#2) from multithreading into main
Reviewed-on: #2
2025-03-19 05:00:59 +00:00
9aa34b3eee epic metrics 2025-03-19 04:59:50 +00:00
de80418c00 better logging 2025-03-18 16:09:46 -06:00
e3e4175f51 logging improvements 2025-03-18 15:25:56 -06:00
d11e7dd27c the biggest 1 line improvement ever 2025-03-18 15:25:40 -06:00
f2a3e836a0 spelling and clippy 2025-03-18 15:08:29 -06:00
3b4e6a40ce minimize vec resizing 2025-03-18 15:07:50 -06:00
bd0b946245 fixed tracing 2025-03-18 15:02:32 -06:00
b7540a4680 checkpoint - onto profiling 2025-03-18 10:53:06 -06:00
Oliver Atkinson
82929fd0fc updating for base64 2024-12-13 13:28:24 -07:00
Oliver Atkinson
f42e770a10 moved to other repo 2024-12-13 11:01:35 -07:00
Oliver Atkinson
611a1e923b starting on the extension 2024-12-12 15:32:04 -07:00
Oliver Atkinson
298ad39a79 rename 2024-12-12 14:59:54 -07:00
Oliver Atkinson
215056e493 use contains operator for better output 2024-12-12 14:26:49 -07:00
Oliver Atkinson
22be3b2f61 updating deps 2024-12-12 14:14:38 -07:00
Oliver Atkinson
c1c8cf07bb unifed settings for testing 2024-12-12 11:42:07 -07:00
0f8a3d7215 using a custom parser now :) 2024-11-12 23:08:09 -07:00
574a370f30 readme updates 2024-11-12 21:24:57 -07:00
eaa79b749e prepare get function for s3 2024-11-12 21:19:05 -07:00
2c28d69d55 add s3 support 2024-11-12 21:03:58 -07:00
d28d18de08 formatting and timer changes, consolidated functions 2024-11-12 18:40:10 -07:00
8a5ac61b26 Merge pull request 'custom_engine' (#1) from custom_engine into main
Reviewed-on: #1
2024-11-13 00:53:31 +00:00
d9d4c56142 fits with compose.yml 2024-11-12 17:51:13 -07:00
f87a43c3a9 added demo site for testing elements 2024-11-12 17:50:59 -07:00
7cac880f8e remove un-used function 2024-11-12 17:50:28 -07:00
720adaa552 added support for nearly all html tags that can have a link 2024-11-12 17:50:06 -07:00
7c32600694 this is really all that's needed 2024-11-12 17:49:45 -07:00
399510c599 use reqwest client for epic speedup 2024-11-10 20:37:00 -07:00
ec66c4e765 remove unused import 2024-11-10 20:36:39 -07:00
a9628ee5e4 working, now onto speeding it up 2024-11-10 20:24:04 -07:00
5404d5c3e8 it works :party: 2024-11-09 23:30:57 -07:00
fd971bafbf it works now 2024-11-09 15:28:10 -07:00
c3997b0bb7 works more, but still not all the way 2024-11-09 11:30:32 -07:00
Oliver Atkinson
7826c4cec6 jank-ish fix but it sure does work
make the root record (for links https://example.com/) have a record id of the url, thus preventing duplication when using upsert
2024-10-31 15:32:37 -06:00
Oliver Atkinson
3a46dd937b updates 2024-10-31 15:09:48 -06:00
Oliver Atkinson
fbca067b1f clean up walk() 2024-10-31 14:10:14 -06:00
Oliver Atkinson
9324160e74 crawling 🕷️ 2024-10-07 11:14:56 -06:00
Oliver Atkinson
974bccc457 no longer using spider, just wiritng my own crawler 2024-10-04 13:52:34 -06:00
2d2b09116e lockfile has update 🤷 2024-08-26 23:15:06 -06:00
38fe7d3a59 add useage 2024-08-26 01:14:10 -06:00