The service is designed to crawl public code available via open-source projects, but also web page code and compressed code in .tar.gz, .tar.bz2, .tar, and .zip format archives, CVS and Subversion repositories. Google is not concerned with plagiarism, or copyright infringement. “Most of the code is open source so you can reuse it. But I don't think that's the primary use--it's more about how to learn about things and, when you're building open-source packages, to make sure you doing it the right way,” said Google product manager Tom Stocky.
Stocky also declared that Google Source Code delivers a service to filter, search and index billions of lines of computer source code and a way for a developer to identify and access snippets of reusable software. “For a long time it has been sort of an unsolved problem,” added Stocky. “It is hard to find references to this sort of data.”