The amount of published scientific research is simply enormous. Current estimates are over 70 million individual research articles, with around 2 million more being published every year. We are in the midst of an information revolution, with the World Wide Web offering rapid, structured and practical distribution of knowledge. But for researchers, this creates the monolith task of manually finding relevant content to fuel their work, and begs the question, are we doing the best we can to leverage this knowledge?
There are already several well-established searchable archives, scientific databases representing warehouses for all of our knowledge and data. The most well-known include the Web of Science, Scopus, PubMed, and Google Scholar, which together are the de facto mode for current methods of information retrieval. The first two of these are paid services, and attempts to replicate searches between all platforms produce inconsistent results (e.g., Bakkalbasi et al., Kulkarni et al.), raising questions about each of their methods of procurement. The search algorithms for each are also fairly opaque, and the relative reliability of each is quite uncertain. Each of them, though, have their own benefits and pitfalls, which are far better discussed elsewhere (e.g. Falagas et al.).
So where does this leave discoverability for researchers in a world that is becoming more and more ‘open’?