16+ results for 'matt wells gigablast' (3565 ms)
1// Matt Wells, copyright Jul 201 2 85 // . if we got it by IP and it was banned, admin has the option to 86 // tell gigablast to automatically add the domain name as banned 87 // to tagdb in Msg14.cpp 106 /* 107 char* printFormattedRec(char* p); 108 void printFormattedRec(SafeBuf *sb);PageRoot.cpp https://github.com/gigablast/open-source-search-engine.git | C++ | 3434 lines
49 char *rootSecure = ""; 50 if ( g_conf.m_isMattWells ) { 51 root = "http://www.gigablast.com"; 51 root = "http://www.gigablast.com"; 52 rootSecure = "https://www.gigablast.com"; 53 } 75 76 if ( g_conf.m_isMattWells ) 77 sb.safePrintf(" " 204 //"<b><a href=\"/\"><font color=red>" 205 //"Powered by Gigablast</font></a></b>" 206 //"<br>" 246 //"<b><a href=\"/\"><font color=red>" 247 //"Powered by Gigablast</font></a></b>" 248 //"<br>"Parms.h https://github.com/gigablast/open-source-search-engine.git | C Header | 561 lines
1// Matt Wells, copyright Feb 2002 2 40 OBJ_SI , // SearchInput class 41 OBJ_GBREQUEST , // for GigablastRequest class of parms 42 OBJ_IR , // InjectionRequest class from PageInject.h 96 97// generic gigablast request. for all apis offered. 98class GigablastRequest { 294 char m_group; // start of a new group of controls? 295 // m_priv = 1 means gigablast's software license clients cannot see 296 // or change. 296 // or change. 297 // m_priv = 2 means gigablast's software license clients, including 298 // even metalincs, cannot see or change. 301 // from quigo so he can set "t2" to something bigger. 302 char m_priv; // true if gigablast's software clients cannot see 303 char m_save; // save to xml file? almost always trueBlaster.cpp https://github.com/gigablast/open-source-search-engine.git | C++ | 1285 lines
1// Matt Wells, copyright Sep 2001 2 213 214 /*if reading a gigablast log file, find the lines that have 215 GET and POST commands for search, and register a sleep 305 urlStart+=22; 306 sprintf(tmp,"http://www.gigablast.com/search?%s",urlStart); 307 } 313 end[0]='\0'; 314 sprintf(tmp,"http://www.gigablast.com%s",urlStart); 315 } 538 log(LOG_WARN,"blaster: Downloading %s",st->m_u2); 539 //char *ss="www.gigablast.com/search?q=hoopla&code=gbmonitor"; 540 // st->m_u2.set(ss,gbstrlen(ss));monitor.cpp https://github.com/gigablast/open-source-search-engine.git | C++ | 730 lines
1// Matt Wells, copyright Feb 2003 2 94 fprintf(stderr,"Usage: monitor www.xyz.com:80 [-h FQHN] [-t]\n"); 95 fprintf(stderr,"FQHN defaults to gigablast.com, but if you are" 96 " not monitoring on gigablast's network then you " 96 " not monitoring on gigablast's network then you " 97 "need to set this to like monitor2.gigablast.com " 98 "or whatever your hostname is so verizon accepts our " 131 g_conf.m_sendEmailAlertsToSysadmin = true; 132 //g_conf.m_sendEmailAlertsToMattAlltell = false; 133 //g_conf.m_sendEmailAlertsToJavier = false; 158 159 // matt wells 160 // call alltel mail server directly to send to matt in case 160 // call alltel mail server directly to send to matt in case 161 // mail.gigablast.com is down 162 // hey, it already goes directly in m_sendEmailAlertsToMattAlltellbio.html https://github.com/gigablast/open-source-search-engine.git | HTML | 200 lines
69</td><td> 702000. Founded <a href=http://www.gigablast.com/>Gigablast</a>. Bootstrapped with $30k. Developed almost all the code myself. Circa 2006 Gigablast had over 12 billion pages indexed and was the second largest search engine in the world. At one time Gigablast also ranked in the top 2000 most popular websites worldwide. To this date, Gigablast continues to serve millions of queries per day using almost all wind power. 71</td></tr> 74<tr><td> 75<a href=http://searchenginewatch.com/article/2067980/A-Conversation-With-Gigablasts-Matt-Wells><img width=128 height=46 src=/sew.png></a> 76</td><td>2003. Interviewed by Gary Price of 76</td><td>2003. Interviewed by Gary Price of 77<a href=http://searchenginewatch.com/article/2067980/A-Conversation-With-Gigablasts-Matt-Wells>Search Engine Watch</a>. In addition to some <a href=http://searchenginewatch.com/article/2063106/Gigablast-Now-Offering-XML-Search-Feeds>followups</a>. 78 128<tr><td> 129<a href=http://www.nmt.edu/nmt-news/16-2007/2960-blasting-into-the-big-leagues-matt-wells-and-gigablast><img src=/bb.png width=128 height=130></a> 130</td><td> 162</td><td> 1632013. The <a href="http://books.google.com/books?id=qLzoWKp2JHcC&pg=PA141&lpg=PA141&dq=gigablast&source=bl&ots=JfwmfpIPKW&sig=sXUFaCsUlxhVfrd2cc4kBKif5LY&hl=en&sa=X&ei=cbn6UfvJB-6GyQGYwICwAg&ved=0CCoQ6AEwADhQ#v=onepage&q=gigablast&f=false">NSA reviewed Gigablast</a> in its book, "Untangling the Web" which was declassified under the Freedom of Information Act. 164</td></tr>HttpServer.h https://github.com/gigablast/open-source-search-engine.git | C Header | 253 lines
1// Copyright Matt Wells Nov 2000 2 148 int32_t *bytesSent = NULL ); 149 bool sendErrorReply ( class GigablastRequest *gr ); 150 // xml and json uses this 150 // xml and json uses this 151 bool sendSuccessReply ( class GigablastRequest *gr,char *addMsg=NULL); 152 bool sendSuccessReply (TcpSocket *s , char format , char *addMsg=NULL); 152 bool sendSuccessReply (TcpSocket *s , char format , char *addMsg=NULL); 153 // send a "prettier" error reply, formatted in XML if necessary 154 bool sendQueryErrorReply ( TcpSocket *s , int32_t error , char *errmsg,Msg13.h https://github.com/gigablast/open-source-search-engine.git | C Header | 212 lines
1// Matt Wells, copyright Oct 2001 2 110 // does m_url represent a FULL http request mime and NOT just a url? 111 // this happens when gigablast is being used like a squid proxy. 112 int32_t m_isSquidProxiedUrl:1;udptest.cpp https://github.com/gigablast/open-source-search-engine.git | C++ | 165 lines
1// Matt Wells, copyright Sep 2001 2 78 // default conf filename 79 char *confFilename = "./gigablast.conf"; 80 if ( ! g_conf.init ( confFilename ) ) {HttpRequest.h https://github.com/gigablast/open-source-search-engine.git | C Header | 305 lines
1// Matt Wells, copyright Sep 2001 2 106 107 // for gigablast's own rendering of squid 108 bool m_isSquidProxyRequest;blaster2.cpp https://github.com/gigablast/open-source-search-engine.git | C++ | 459 lines
1// Matt Wells, copyright Sep 2001 2 116 fprintf(stderr,"USAGE: ./blaster queries.fromlog 10 1\n"); 117 fprintf(stderr,"USAGE: ./blaster -r3http://www.gigablast.com/index.php?q= 1 100\n"); 118 return 1;Log.h https://github.com/gigablast/open-source-search-engine.git | C Header | 176 lines
1// Matt Wells, copyright Feb 2001 2 61// example log: 62//456456454 0 INIT Gigablast Version 1.234 63//454544444 0 INIT thread Allocated 435333 bytes for thread stacks.dnstest.cpp https://github.com/gigablast/open-source-search-engine.git | C++ | 152 lines
1// Matt Wells, copyright Jan 2002 2 40 // use default 41 //if ( argc < 3 ) confFilename = "/gigablast/gigablast.conf"; 42 // start up log fileREADME.md https://github.com/gigablast/open-source-search-engine.git | Markdown | 44 lines
3 4An open source web and enterprise search engine and spider/crawler. As can be seen on http://www.gigablast.com/ . 5 5 6RUNNING GIGABLAST 7----------------- 12 13Alternatively, visit http://www.gigablast.com/faq.html 14 22 23Alternatively, visit http://www.gigablast.com/developer.html 24 30Contact me for feature requests or help in general. I will work for free 31for good use cases. mattdwells@hotmail.com. 32about.html https://github.com/gigablast/open-source-search-engine.git | HTML | 34 lines
8 9<p>As of 2013, Gigablast is one of the remaining four search engines in the United States that maintains its own searchable index of over a billion pages. 10</p> 11<br> 12<p>Founded in 2000, <a href=/bio.html>Matt Wells</a> created Gigablast to index up to 200 Billion pages 13with the least amount of hardware possible. Gigablast provides large-scale, 24<p> 25For more information, <a href=/contact.html>contact Gigablast</a>. 26</P>init.gb.conf https://github.com/gigablast/open-source-search-engine.git | Config | 23 lines
1# Gigablast Search Engine Service 2 3description "Gigablast Search Engine Service" 4author "Matt Wells <gigablast@mail.com>" 5 11 12env HOME=/var/gigablast/data0/ 13umask 007 17 18# this will read /etc/gigablast/hosts.conf and start up the 19# hosts in there that are local on this machine based on its ip address. 21# port and just exit right away without doing any harm. 22exec /var/gigablast/data0/gb 23