| Line | Revision | Contents |
| 1 | 1 | 20.11.98 |
| 2 | snad finalne dodelan navrh konfigurace, jsou tu jeste nejake | |
| 3 | nejasnosti ale zacinam jiz kodovat :) | |
| 4 | 23.11.98 | |
| 5 | z scache.cnf se nacita i cacheroot | |
| 6 | cislovani radek v konfiguracnim souboru a jejich zobrazovani | |
| 7 | pri chybach | |
| 8 | location object redesigned | |
| 9 | 26.11.98 | |
| 10 | parse() in engine class options hotovo | |
| 11 | program parsuje mnohem inteligentneji nez jsem cekal ! na konci | |
| 12 | z mne neznameho duvodu ignoruje. Pak pry ze neni mozne vytvorit | |
| 13 | dilo, ktere prevysuje tvurce. | |
| 14 | 27.11.98 | |
| 15 | zmatlan priority options sorter. | |
| 16 | 30.11.98 | |
| 17 | fixnut priority sorter (pridavani separatoru) | |
| 18 | 13.12.98 | |
| 19 | udelano options.addDefaults() | |
| 20 | 18.12.98 | |
| 21 | parser na location.serveroptions udelan | |
| 22 | zacala prace na maskach | |
| 23 | 19.12.98 | |
| 24 | parser na masky | |
| 25 | 20.12.98 | |
| 26 | parser na masky dodelan (UFFFF) | |
| 27 | target guesser | |
| 28 | strip guesser | |
| 29 | configloader done, so we can begin the real work! | |
| 30 | source 45k | |
| 31 | 22.12.98 | |
| 32 | zmatlana priority queue vicemene funkcni | |
| 33 | pq opravena a rozsirena o automaticke natahovani | |
| 34 | dodelana fce pop() - vybirani z pq | |
| 35 | 24.12.98 | |
| 36 | pq.search() | |
| 37 | request.java defined | |
| 38 | 25.12.98 | |
| 39 | thread managing | |
| 40 | 26.12.98 | |
| 41 | prace na vlastnim zpracovani requestu, hacknut html parser | |
| 42 | 27.12.98 | |
| 43 | uz to parsuje html, opravena (zpouzitelnena) localstore | |
| 44 | 28.12.98 | |
| 45 | rozrezavac dir+file v url na dir (util) | |
| 46 | zjistovac targetu (mask) | |
| 47 | opraveny default actions v location | |
| 48 | uz vicemene brouzda | |
| 49 | io buffery - nutnost, jinak pomale... | |
| 50 | 29.12.98 | |
| 51 | uz to vicemene facha... ale je to S.L.O.W. | |
| 52 | ||
| 53 | 31.12.98 | |
| 54 | inject sites now implemented | |
| 55 | opraven bug v regexp upper/lower case | |
| 56 | ||
| 57 | 2.01.99 | |
| 58 | hacky pro zlepseni commandline argumentu | |
| 59 | - configure as sublocation | |
| 60 | - | |
| 61 | dns aliasy jsou case-insensitive | |
| 62 | zrychleni dnsunaliasingu | |
| 63 | podpora pro cislo portu ve starturl | |
| 64 | 4.01.99 | |
| 65 | pridava se user-agent mozilla/2 | |
| 66 | 85k source | |
| 67 | util.getext vraci Query pri ? = ; | |
| 68 | .cfg keyword AddActions (Nemeni defaultmasku) | |
| 69 | 11.01.99 | |
| 70 | Vylepsen command-line option parser, pridany nejake volby | |
| 71 | podpora pro CSS html tag | |
| 72 | 15.01.99 | |
| 73 | pridelana podpora user-configurable maxthreads, retry a retrypriority | |
| 74 | opraven configloader | |
| 75 | 16.01.99 | |
| 76 | predelan depthset - depth je mozno *vzdy* snizit | |
| 77 | 21.01.99 | |
| 78 | fixnut bug pri initial config loadingu | |
| 79 | prace s localstore je nyni abstraktni | |
| 80 | definice proxy serveru je nyni separatni | |
| 81 | fix null savefile name | |
| 82 | 27.01.99 | |
| 83 | 2 | localstore smartcache (SLOW) |
| 84 | 1 | fixnuta url= processing and striping |
| 85 | 2 | 28.01.99 |
| 86 | 1 | depth je alias pro scandepth |
| 87 | upd=noreparse option | |
| 88 | direct load (noproxy) | |
| 89 | 29.01.99 | |
| 90 | frame and http redirect hack | |
| 91 | generic SRC a HREF handler | |
| 92 | version 0.13 | |
| 93 | 30.01.99 | |
| 94 | anysrc podpora pro masku | |
| 95 | fixnut bug v & handlingu | |
| 96 | lepsi redir handling - dedi se i nastaveni | |
| 97 | a nasledne v , handlingu | |
| 98 | 2 | version 0.14 |
| 99 | 1 | 6.2.99 |
| 100 | 2 | fixnut fatal bug v HTML parseru. |
| 101 | 1 | Parser asi zrychlen |
| 102 | 2 | v 0.15 |
| 103 | 1 | 27.2.99 |
| 104 | fixnut options parser pri name=x<crlf> | |
| 105 | zprovozneno depth option v mask | |
| 106 | 2 | v 0.16 |
| 107 | ||
| 108 | 15.3.99 Obslouzena chyba "SmartCache config was not found" | |
| 109 | 1 | vlozeny defaulty ze SC |
| 110 | 14.9.99 | |
| 111 | v0.17beta zahajena | |
| 112 | Debianizovano | |
| 113 | pridana hlaska config processing ended | |
| 114 | hlasky posilame na stderr misto stdout | |
| 115 | stdout bude vyhrazen pro vystup URL | |
| 116 | ||
| 117 | 2 | 30.11.99 |
| 118 | 1 | mask - podpora pro LOG_* |
| 119 | implementovana podpora pro logovani | |
| 120 | ||
| 121 | 1.12.99 | |
| 122 | podpora pro includovani souboru na cmdline '@' | |
| 123 | podpora pro 'known URL' pomoci syntaxe :URL | |
| 124 | konfigurace zmenena na #cfgfile | |
| 125 | rekurzivni 'INCLUDE' @ | |
| 126 | pridelany dalsi typy protokolovani | |
| 127 | protokolovani podporuje uz i short formu (URLonly) | |
| 128 | ||
| 129 | 11.3.2000 includovani known URLs pomoci :@list | |
| 130 | ||
| 131 | 1.4.2000 kompletni prepsani localstore casti | |
| 132 | ||
| 133 | 11.4.2000 | |
| 134 | Version 0.18 | |
| 135 | prepsana localstore | |
| 136 | opravena chyba pokud dostaneme Location: s relativnim URL | |
| 137 | podpora pro body background | |
| 138 | ||
| 139 | 12.4.2000 | |
| 140 | Vytvoren pocatek manualu | |
| 141 | opravena nullstore | |
| 142 | pridana do localstore podpora zda je read-only ci ne | |
| 143 | odstranen keyword serveralias -> nahrazen alias | |
| 144 | size !known | |
| 145 | ||
| 146 | 13.4.2000 | |
| 147 | localstore umi nacitat soubory | |
| 148 | opet funkcni localstore pro scache | |
| 149 | 14.4.2000 | |
| 150 | podpora pro log=server | |
| 151 | 16.4.2000 | |
| 152 | opravena chyba v log | |
| 153 | pridana podpora pro stored redirects | |
| 154 | ||
| 155 | 24.4.2000 | |
| 156 | Smart Cache store ignoruje soubory s filename <> | |
| 157 | Dela se stack dump u exceptionu krome IO | |
| 158 | ||
| 159 | 28.5.2000 | |
| 160 | opravena NPE pri cteni HTTPRC | |
| 161 | ||
| 162 | 23.9.2000 | |
| 163 | podpora pro .cacheinfo VERSION 3 | |
| 164 | obslouzen snad dobre <AREA HREF=""> pro depth 0 | |
| 165 | 2 | |
| 166 | 1 | 27.9.2000 |
| 167 | podpora pro kompresi GZIP | |
| 168 | verze 0.19 | |
| 169 | ||
| 170 | 2 | 30.9.2000 |
| 171 | 1 | threads v konfiguracnim souboru uz funguje |
| 172 | ||
| 173 | 3.10.2000 | |
| 174 | podpora pro ^URL=dalsi start location pro known url | |
| 175 | command line parametry pro URL | |
| 176 | 20.12.2001 | |
| 177 | prace na manualu | |
| 178 | 22.12.2001 | |
| 179 | @file muze obsahovat mezery pro oddeleni argumentu | |
| 180 | verze 0.20 | |
| 181 | 12.4.2002 | |
| 182 | pri configured as se kopiruji i masky | |
| 183 | jsou spravne zpracovany url nekoncici na / | |
| 184 | manual updated | |
| 185 | 15.4.2002 | |
| 186 | manual updated | |
| 187 | 17.4.2002 | |
| 188 | tweaked Makefiles | |
| 189 | manual and readme updated | |
| 190 | verze 0.21 | |
| 191 | 2 | 29.4.2002 |
| 192 | 1 | support for BASE HREF tag |
| 193 | depth can be overriden from command line when known sublocation | |
| 194 | is matched | |
| 195 | 10.5.2002 | |
| 196 | priority override on command line works for preconfigured sites | |
| 197 | 11.5.2002 | |
| 198 | released as 0.22 | |
| 199 | 30.7.2002 | |
| 200 | support for new smartcache v4 .cacheinfo files | |
| 201 | released as 0.23 | |
| 202 | 01.12.2002 | |
| 203 | corrected handling of io errors in some cases | |
| 204 | 04.01.2003 | |
| 205 | retry fetching when connection is closed | |
| 206 | 06.01.2003 | |
| 207 | released as 0.24 | |
| 208 | 09.02.2003 | |
| 209 | when connection is closed, do not do INFINITE retries | |
| 210 | 2 | 20.05.2003 |
| 211 | 1 | use cache_dir in scache.cnf |
| 212 | 22.05.2003 | |
| 213 | use cacheroot also in scache.cnf | |
| 214 | released as 0.25 | |
| 215 | 13.10.2003 | |
| 216 | do not retry when URL syntax is bad | |
| 217 | 3.1.2004 | |
| 218 | send real Referer URLs | |
| 219 | 6.1.2004 | |
| 220 | referer= command line option | |
| 221 | Referer location keyword | |
| 222 | 7.1.2004 | |
| 223 | custom user_agent configuration statement | |
| 224 | 8.1.2004 | |
| 225 | -d command line option, all url are default | |
| 226 | released as 0.26 | |
| 227 | 13.2.2004 | |
| 228 | 2 | removed some forgotten debug prints |
| 229 | 1 | 31.3.2007 |
| 230 | updated copyrights, changed to GPLv2 | |
| 231 | 12.4.2007 | |
| 232 | 2 | ported to BSD make and BSD userland tools. |
| 233 | 1 | Now build system works in FreeBSD and Linux |
| 234 | With both BSD and GNU Make | |
| 235 | Manual migrated from Debiandoc SGML to DocBook XML | |
| 236 | 13.4.2007 | |
| 237 | Changed default number of threads to 8, like in Opera | |
| 238 | Changed default user-agent to Firefox 2/Linux | |
| 239 | 2 | Fixed few typos in manual |
| 240 | Updated included regexp engine to latest version from Smart Cache | |
| 241 | Don't distribute .class files, go for JAR | |
| 242 | whitespace source code cleanup | |
| 243 | 3 | compress distribution zip file with advanced zip instead of info zip |
| 244 | 4 | Fixed crash in nullstore |
| 245 | 5 | Fixed crash when running without config file |
| 246 | 6 | released as 0.27 |
| 247 | 7 | 14.4.2007 |
| 248 | 9 | rename localstore 'null' to 'none' |
| 249 | fixed crash when no defaultmasks were used | |
| 250 | 12 | added support for log=reject with mask url=.... act=reject log=reject |
| 251 | you can extract links to specific mask from crawl | |
| 252 | 14 | added support for crawling delay configurable per-location. |
| 253 | 15 | command line argument and .cnf setting is called 'delay' (seconds) |
| 254 | distribute JAR scloader.jar not loader.jar | |
| 255 | 16 | 15.4.2007 |
| 256 | Version bumped to 0.28 and released. | |
| 257 | 18 | 16.4.2007 |
| 258 | Added support for \ escaping & and , in URLs | |
| 259 | 19 | 25.7.2007 |
| 260 | Delay parameter can take time units like 1.3s, 2m etc | |
| 261 | 23 | Added parameter crawltime (command line too) for limiting time spent |
| 262 | on crawling site | |
| 263 | 24 | Version bumped to 0.29 and released. |
| 264 | 25 | 03.8.2007 |
| 265 | Fixed index out of range crash during striping server from URL | |
| 266 | 26 | 05.8.2007 |
| 267 | unused function findString removed from htmlscanner.java | |
| 268 | 28 | 08.8.2007 |
| 269 | new server options rememberseen/remembervisited | |
| 270 | new location option extracturl regex replacerx (src tag set to CONTENT) | |
| 271 | log can now trace depth for example log=depth,url | |
| 272 | 30 | Version bumped to 0.30 |
| 273 | 31 | 09.8.2007 |
| 274 | 32 | improved doc about extracturl |
| 275 | oops, forgot to remove debug print | |
| 276 | 34 | fix activating of local stores, due to bug loader almost always |
| 277 | used nullstore | |
| 278 | 35 | send Host: header in HTTP/1.0 requests, not needed when using proxy |
| 279 | (proxy adds it) | |
| 280 | 36 | 10.8.2007 |
| 281 | We now decoding page text before starting to extract links. | |
| 282 | HTML decoding code is from http://htmlparser.sf.net | |
| 283 | 38 | Fixed handling of high bit characters in page text. |
| 284 | 39 | Version increased to 0.31 |
| 285 | 42 | Request gzip encoding from server |
| 286 | 43 | Copy extractmasks too when creating new location from template |
| 287 | 44 | Released as 0.31 |
| 288 | 45 | Added support for defaultextracturl |
| 289 | 46 | 01.7.2008 |
| 290 | Support spaces in smart cache configuration file name |
Loggerhead is a web-based interface for Bazaar branches