Koders, Krugle, Codase, Google Code Search

I found a Google Talk on a topic that's related to the motto of my blog - "good coders code, great reuse".

In this talk, professor Tao Xie speaks about his research on using public code repositories together with code search engines for finding common API usage patterns and anti-patterns.

His research software uses the following four code search engines.

He suggests to view Raphael Volz's analysis for more information about these search engines.

Tao has developed three tools, which use the aforementioned search engines:

  • PARSEWeb for finding API usage patterns,
  • XWeb for finding forgotten exception handlers, and
  • NEGWeb for finding misuses of API calls.

See the code mining project website for more information.

The lecture is done in a very academic manner and it's very hard to follow. Be sure that you are really interested in this topic before watching it.

Some excerpts from the lecture:

  • [04:26] A problem with data mining on source code is that it might not have enough data points (usages of API) to discover common patterns.
  • [04:58] It is crucial to have a lot of data points to get good results out of data mining
  • [08:37] Google Code Search indexes publicly hosted SVN and CVS repositories.
  • [09:20] Example of searching for C stdlib's fopen usage on Google Code Search (query: "lang:C file:.c$ fopen\s*\("
  • [11:08] Example of the same search on Krugle.
  • [16:40] Code search engines return partial code samples. Various heuristics are used for type inference.
  • [22:05] Example of integrating Tao's PARSEWeb into Eclipse.
  • [28:15] Interesting idea of constructing and issuing multiple queries to find more code samples.
  • [36:20] A study showed that a proper deallocation of resources after an exception resulted in 17% performance increase.

I'd like to hear some comments on websites that you use for finding code examples!


Utopiah Permalink
July 17, 2008, 05:37

Well I send you an email with this video few weeks ago (June the 8th) and Im glad to see you've put up your sum-up online. Why ?
Because regarding easy access to information, learning video are great BUT
accessing a specific part without meta-data is hard. By providing your
sum-ups you actually provide anchors for the videos. It helps for
direct access but actually it also help for *memorizing* as after seeing
a video once, when someone read your sum-up (as Im doing now) it will re-activate
their memories (cf
http://www.wired.com/medtech/health/magazine/16-05/ff_wozniak?currentPage=2 ).


July 17, 2008, 13:52

reāli noderīga lieta! paldies, nezināju.

Rohaila Jackson Permalink
May 03, 2014, 12:38

I also think that Krugle is not relevant and not very good. I don't have vast knowledge about this, I think I need more information. If you assist me here by posting more about Krugle/Google code search, I will be fully able to complete my research. Please give me online dissertation help here.

Nace Permalink
June 11, 2014, 10:44

Informative update as i was looking for some important points for Google other wise i was going to contact to essay writers UK in this regard.

JamesMart Permalink
June 13, 2014, 12:35

Ask, Do My Assignment For Me with UK Assignment Writing Experts and Acquire Good Grades In Your Academic Qualification.

September 05, 2014, 18:23

Yes you're right . Its a marvelous info for all of us. Thanks Admin.

I think open source code is easily found online on the web and can be understood and applied to any website easily.It is an adaptable code for developers.

January 24, 2015, 09:31

Google have best strategy to making hot codes but we have to find these hot codeshttp. http://www.needpaperhelp.com

January 27, 2015, 20:15

This is a very useful post for our IT students. Another tip for their writing skills.

January 27, 2015, 20:18

In many ways, Information technology can benefit student writing skills, there are a lot to learn from from this post.

Leave a new comment

(why do I need your e-mail?)

(Your twitter name, if you have one. (I'm @pkrumins, btw.))

Type the word "lcd_72": (just to make sure you're a human)

Please preview the comment before submitting to make sure it's OK.