PageRenderTime 4ms queryTime 99ms sortTime 0ms getByIdsTime 605ms findMatchingLines 113ms

18+ results for 'matt wells gigablast' (4 ms)

Not the results you expected?
bio.html https://github.com/gigablast/open-source-search-engine.git | HTML | 200 lines
                    
69</td><td>
                    
702000. Founded <a href=http://www.gigablast.com/>Gigablast</a>. Bootstrapped with $30k. Developed almost all the code myself. Circa 2006 Gigablast had over 12 billion pages indexed and was the second largest search engine in the world. At one time Gigablast also ranked in the top 2000 most popular websites worldwide. To this date, Gigablast continues to serve millions of queries per day using almost all wind power.
                    
71</td></tr>
                    
74<tr><td>
                    
75<a href=http://searchenginewatch.com/article/2067980/A-Conversation-With-Gigablasts-Matt-Wells><img width=128 height=46 src=/sew.png></a>
                    
76</td><td>2003. Interviewed by Gary Price of  
                    
76</td><td>2003. Interviewed by Gary Price of  
                    
77<a href=http://searchenginewatch.com/article/2067980/A-Conversation-With-Gigablasts-Matt-Wells>Search Engine Watch</a>. In addition to some <a href=http://searchenginewatch.com/article/2063106/Gigablast-Now-Offering-XML-Search-Feeds>followups</a>.
                    
78
                    
128<tr><td>
                    
129<a href=http://www.nmt.edu/nmt-news/16-2007/2960-blasting-into-the-big-leagues-matt-wells-and-gigablast><img src=/bb.png width=128 height=130></a>
                    
130</td><td>
                    
162</td><td>
                    
1632013. The <a href="http://books.google.com/books?id=qLzoWKp2JHcC&pg=PA141&lpg=PA141&dq=gigablast&source=bl&ots=JfwmfpIPKW&sig=sXUFaCsUlxhVfrd2cc4kBKif5LY&hl=en&sa=X&ei=cbn6UfvJB-6GyQGYwICwAg&ved=0CCoQ6AEwADhQ#v=onepage&q=gigablast&f=false">NSA reviewed Gigablast</a> in its book, "Untangling the Web" which was declassified under the Freedom of Information Act.
                    
164</td></tr>
                    
                
developer.html https://github.com/gigablast/open-source-search-engine.git | HTML | 2729 lines
                    
11
                    
12<b>CAUTION: This documentation is old and a lot of it is out of date. -- Matt, May 2014</b>
                    
13<br><br>
                    
20
                    
21<tr><td>I.</td><td><a href="#started">Getting Started</a> - Setting up your PC for Gigablast development.</td></tr>
                    
22<!--subtable-->
                    
31
                    
32<tr><td>IV.</td><td><a href="#hardware">Hardware Administration</a> - Gigablast hardware resources.</td></tr>
                    
33
                    
35
                    
36<tr><td>VI.</td><td><a href="#kernels">Kernels</a> - Kernels used by Gigablast.</td></tr>
                    
37
                    
37
                    
38<tr><td>VII.</td><td><a href="#coding">Coding Conventions</a> - The coding style used at Gigablast.</td></tr>
                    
39
                    
                
CatRec.h https://github.com/gigablast/open-source-search-engine.git | C Header | 527 lines
                    
1// Matt Wells, copyright Jul 201
                    
2
                    
85	// . if we got it by IP and it was banned, admin has the option to
                    
86	//   tell gigablast to automatically add the domain name as banned
                    
87	//   to tagdb in Msg14.cpp
                    
106	/*
                    
107	char* printFormattedRec(char* p);
                    
108	void  printFormattedRec(SafeBuf *sb);
                    
                
PageRoot.cpp https://github.com/gigablast/open-source-search-engine.git | C++ | 3434 lines
                    
49	char *rootSecure = "";
                    
50	if ( g_conf.m_isMattWells ) {
                    
51		root       = "http://www.gigablast.com";
                    
51		root       = "http://www.gigablast.com";
                    
52		rootSecure = "https://www.gigablast.com";
                    
53	}
                    
75
                    
76	if ( g_conf.m_isMattWells )
                    
77		sb.safePrintf(" &nbsp; &nbsp; "
                    
204			  //"<b><a href=\"/\"><font color=red>"
                    
205			  //"Powered by Gigablast</font></a></b>"
                    
206			  //"<br>"
                    
246			  //"<b><a href=\"/\"><font color=red>"
                    
247			  //"Powered by Gigablast</font></a></b>"
                    
248			  //"<br>"
                    
                
Parms.h https://github.com/gigablast/open-source-search-engine.git | C Header | 561 lines
                    
1// Matt Wells, copyright Feb 2002
                    
2
                    
40	OBJ_SI          , // SearchInput class
                    
41	OBJ_GBREQUEST   , // for GigablastRequest class of parms
                    
42	OBJ_IR          , // InjectionRequest class from PageInject.h
                    
96
                    
97// generic gigablast request. for all apis offered.
                    
98class GigablastRequest {
                    
294	char  m_group; // start of a new group of controls?
                    
295	// m_priv = 1 means gigablast's software license clients cannot see
                    
296	//            or change.
                    
296	//            or change.
                    
297	// m_priv = 2 means gigablast's software license clients, including
                    
298	//            even metalincs, cannot see or change.
                    
301	//            from quigo so he can set "t2" to something bigger.
                    
302	char  m_priv;  // true if gigablast's software clients cannot see
                    
303	char  m_save;  // save to xml file? almost always true
                    
                
Blaster.cpp https://github.com/gigablast/open-source-search-engine.git | C++ | 1285 lines
                    
1// Matt Wells, copyright Sep 2001
                    
2
                    
213		
                    
214		/*if reading a gigablast log file, find the lines that have 
                    
215		  GET and POST commands for search, and register a sleep
                    
305		urlStart+=22;
                    
306		sprintf(tmp,"http://www.gigablast.com/search?%s",urlStart);
                    
307	}
                    
313			end[0]='\0';
                    
314		sprintf(tmp,"http://www.gigablast.com%s",urlStart);
                    
315	}
                    
538	log(LOG_WARN,"blaster: Downloading %s",st->m_u2);
                    
539	//char *ss="www.gigablast.com/search?q=hoopla&code=gbmonitor";
                    
540	//	st->m_u2.set(ss,gbstrlen(ss));
                    
                
monitor.cpp https://github.com/gigablast/open-source-search-engine.git | C++ | 730 lines
                    
1// Matt Wells, copyright Feb 2003
                    
2
                    
94		fprintf(stderr,"Usage: monitor www.xyz.com:80 [-h FQHN] [-t]\n");
                    
95		fprintf(stderr,"FQHN defaults to gigablast.com, but if you are"
                    
96			" not monitoring on gigablast's network then you "
                    
96			" not monitoring on gigablast's network then you "
                    
97			"need to set this to like monitor2.gigablast.com "
                    
98			"or whatever your hostname is so verizon accepts our "
                    
131	g_conf.m_sendEmailAlertsToSysadmin    = true;
                    
132	//g_conf.m_sendEmailAlertsToMattAlltell = false;
                    
133	//g_conf.m_sendEmailAlertsToJavier      = false;
                    
158
                    
159	// matt wells
                    
160	// call alltel mail server directly to send to matt in case
                    
160	// call alltel mail server directly to send to matt in case
                    
161	// mail.gigablast.com is down
                    
162	// hey, it already goes directly in m_sendEmailAlertsToMattAlltell
                    
                
HttpServer.h https://github.com/gigablast/open-source-search-engine.git | C Header | 253 lines
                    
1// Copyright Matt Wells Nov 2000
                    
2
                    
148			      int32_t *bytesSent = NULL ); 
                    
149	bool sendErrorReply ( class GigablastRequest *gr );
                    
150	// xml and json uses this
                    
150	// xml and json uses this
                    
151	bool sendSuccessReply ( class GigablastRequest *gr,char *addMsg=NULL);
                    
152	bool sendSuccessReply (TcpSocket *s , char format , char *addMsg=NULL);
                    
152	bool sendSuccessReply (TcpSocket *s , char format , char *addMsg=NULL);
                    
153	// send a "prettier" error reply, formatted in XML if necessary
                    
154	bool sendQueryErrorReply ( TcpSocket *s , int32_t error , char *errmsg,
                    
                
LICENSE https://github.com/gigablast/open-source-search-engine.git | License | 202 lines
                    
178
                    
179   Copyright 2013 Web Research Properties, LLC and Matt Wells and Gigablast, Inc.
                    
180
                    
196file named "seo.cpp", and is not included in this license. If you would like to 
                    
197license that then you can arrange a licensing agreement with Matt Wells.
                    
198
                    
                
Msg13.h https://github.com/gigablast/open-source-search-engine.git | C Header | 212 lines
                    
1// Matt Wells, copyright Oct 2001
                    
2
                    
110	// does m_url represent a FULL http request mime and NOT just a url?
                    
111	// this happens when gigablast is being used like a squid proxy.
                    
112	int32_t  m_isSquidProxiedUrl:1;
                    
                
udptest.cpp https://github.com/gigablast/open-source-search-engine.git | C++ | 165 lines
                    
1// Matt Wells, copyright Sep 2001
                    
2
                    
78		// default conf filename
                    
79	char *confFilename = "./gigablast.conf";
                    
80	if ( ! g_conf.init ( confFilename ) ) {
                    
                
HttpRequest.h https://github.com/gigablast/open-source-search-engine.git | C Header | 305 lines
                    
1// Matt Wells, copyright Sep 2001
                    
2
                    
106
                    
107	// for gigablast's own rendering of squid
                    
108	bool m_isSquidProxyRequest;
                    
                
blaster2.cpp https://github.com/gigablast/open-source-search-engine.git | C++ | 459 lines
                    
1// Matt Wells, copyright Sep 2001
                    
2
                    
116		fprintf(stderr,"USAGE:  ./blaster queries.fromlog 10 1\n");
                    
117		fprintf(stderr,"USAGE:  ./blaster -r3http://www.gigablast.com/index.php?q= 1 100\n");
                    
118		return 1; 
                    
                
dnstest.cpp https://github.com/gigablast/open-source-search-engine.git | C++ | 152 lines
                    
1// Matt Wells, copyright Jan 2002
                    
2
                    
40	// use default
                    
41	//if ( argc <  3 ) confFilename = "/gigablast/gigablast.conf";
                    
42	// start up log file
                    
                
Log.h https://github.com/gigablast/open-source-search-engine.git | C Header | 176 lines
                    
1// Matt Wells, copyright Feb 2001
                    
2
                    
61// example log:
                    
62//456456454 0 INIT         Gigablast Version 1.234
                    
63//454544444 0 INIT  thread Allocated 435333 bytes for thread stacks.
                    
                
about.html https://github.com/gigablast/open-source-search-engine.git | HTML | 34 lines
                    
8
                    
9<p>As of 2013, Gigablast is one of the remaining four search engines in the United States that maintains its own searchable index of over a billion pages.
                    
10</p>
                    
11<br>
                    
12<p>Founded in 2000, <a href=/bio.html>Matt Wells</a> created Gigablast to index up to 200 Billion pages
                    
13with the least amount of hardware possible. Gigablast provides large-scale,
                    
24<p>
                    
25For more information, <a href=/contact.html>contact Gigablast</a>.
                    
26</P>
                    
                
README.md https://github.com/gigablast/open-source-search-engine.git | Markdown | 44 lines
                    
3
                    
4An open source web and enterprise search engine and spider/crawler. As can be seen on http://www.gigablast.com/ .
                    
5
                    
5
                    
6RUNNING GIGABLAST
                    
7-----------------
                    
12
                    
13Alternatively, visit http://www.gigablast.com/faq.html
                    
14
                    
22
                    
23Alternatively, visit http://www.gigablast.com/developer.html
                    
24
                    
30Contact me for feature requests or help in general. I will work for free
                    
31for good use cases. mattdwells@hotmail.com.
                    
32
                    
                
init.gb.conf https://github.com/gigablast/open-source-search-engine.git | Config | 23 lines
                    
1# Gigablast Search Engine Service
                    
2
                    
3description     "Gigablast Search Engine Service"
                    
4author          "Matt Wells <gigablast@mail.com>"
                    
5
                    
11
                    
12env HOME=/var/gigablast/data0/
                    
13umask 007
                    
17
                    
18# this will read /etc/gigablast/hosts.conf and start up the
                    
19# hosts in there that are local on this machine based on its ip address.
                    
21# port and just exit right away without doing any harm.
                    
22exec /var/gigablast/data0/gb
                    
23
                    
                
 

Source

Language