#soylent | Logs for 2024-11-26
« return
[20:57:16] <janrinok> I'm ending my day - cu tomorrow guys
[20:27:59] -!- Runaway1956 [Runaway1956!~OldGuy@the.abyss.stares.back] has joined #soylent
[20:27:38] -!- Runaway1956 has quit [Read error: -0x7880: SSL - The peer notified us that the connection is going to be closed]
[19:02:41] <Fnord666> Thanks!
[17:04:41] <systemd> ^ 03Show HN: Extract Markdown, HTML or text from content-heavy websites
[17:04:41] <systemd> ^ 03Content Parser Website
[17:04:40] <fab23> out of my notes, also https://content-parser.com (through https://news.ycombinator.com with hints in comments for some other tools as well)
[17:04:00] <Fnord666> Gotcha
[17:03:47] <fab23> Fnord666: not specifically, just tools to "extract" content from webpages
[17:02:38] <Fnord666> Are you talking about tools like Selenium?
[17:02:37] <systemd> ^ 03GitHub - orf/html-query: jq, but for HTML ( https://github.com )
[17:02:36] <systemd> ^ 03GitHub - mgdm/htmlq: Like jq, but for HTML.
[17:02:35] <fab23> janrinok: other maybe useful things https://github.com or https://github.com
[17:01:45] <fab23> janrinok: I just checked my notes, you are aware that browsers (Firefox, Chrome) could also be used in headless mode with some command line options?
[16:50:26] <kolie> I'll pull that list in a minute.
[16:44:05] <janrinok> It can already take a URL and in most cases present it in a format that can be directly submitted - even automatically if you wish. It can still have problems with some sites that are now obfuscating their content to prevent scraping.
[16:42:27] <Fnord666> janrinok, I just saw that you were asking about templating and wondered what that was in relation to. I thought maybe it was Arthur
[16:40:58] <janrinok> Fnord666, how do you want submissions templating. It is possible that Arthur might already do some of what you would like. I can perhaps use it as a starting point if you can explain exactly what you want it to do.
[16:36:57] <Fnord666> yes
[16:33:33] <kolie> You were trying to get a list of feeds regurgitator knows about?
[16:31:53] <Fnord666> Good morning
[16:31:34] <kolie> What's up guys
[13:46:39] -!- madcow has quit [Ping timeout: 272 seconds]
[13:42:48] -!- madcow_ [madcow_!~Madcow@120.19.rxi.xp] has joined #soylent
[12:32:15] -!- jje [jje!jje@ddjffrbpjin.info] has joined #soylent
[12:23:11] -!- mode/#soylent [+v fliptop] by Imogen
[12:23:11] -!- fliptop [fliptop!~fliptop@Soylent/Staff/Sysop/fliptop] has joined #soylent
[12:23:11] -!- fliptop has quit [Changing host]
[12:22:47] -!- fliptop [fliptop!~fliptop@69.43.kn.gu] has joined #soylent
[11:37:27] -!- jje has quit [Remote host closed the connection]
[04:47:41] -!- madcow_ has quit [Ping timeout: 272 seconds]
[04:44:18] -!- madcow [madcow!~Madcow@101.119.pp.vml] has joined #soylent
[02:50:42] -!- inz [inz!~inz@wbi.fi] has joined #soylent
[02:34:41] -!- inz has quit [Ping timeout: 272 seconds]
[02:34:41] -!- alexbst has quit [Ping timeout: 272 seconds]
[00:43:00] -!- systemd [systemd!~systemd@pid1] has joined #soylent