KDE Plans 'Google-like' Search Capabilities 356
CoolFX writes "Developers of KDE have announced plans to simplify searching for files on the open-source Linux desktop environment by adding a Google-style search feature. The next version of KDE, which will either be called 3.4 or 4, is expected to include the new search feature... Aaron Seigo, a KDE developer, said the community has already been discussing and writing code for the new search engine at the KDE Community World Summit."
How will this work? (Score:5, Funny)
Aside from the logistics problems, where the heck am I going to get the pigeons [google.com] anyway?
Re:How will this work? (Score:5, Informative)
Having played with and done some work on the open source Nutch [nutch.org] search engine I know from experience that you can return search results from ~10,000,000 pages in much less than 1 second on a mid-range desktop. It's all done with indexes in much the same way as relational DB have been doing it for years.
Re:How will this work? (Score:5, Insightful)
Re:How will this work? (Score:5, Insightful)
Re:How will this work? (Score:2, Informative)
Re:How will this work? (Score:2, Informative)
Re:How will this work? (Score:3, Insightful)
Now, the French have an entire division of the government devoted to defining the French langua
Re:How will this work? (Score:3, Insightful)
And you need to crank yours up even more. The OED is accepted by many institutions to define the English language. Words in use that are not in the OED are considered slang and/or technical language. While you are correct that there is no royally or federally defined definitive language, the OED is, by common agreement, the definition of the language.
--
Evan
Re: what is "to google" on french? (Score:3, Informative)
googler ('goog-lay'), v.t.
Re:How will this work? (Score:3, Insightful)
When I hear a middle aged waitress (who's not known to be especially computer savvy) use the term "google" as a verb--and in the right context--well, let's just say that google has become a word. Just like xerox, kleenex, or any other widely used trademarks.
Welcome to our lexicon, the word "google".
Re:How will this work? (Score:3, Interesting)
Re:How will this work? (Score:3, Informative)
Did you even bother to try googling [google.com] for it??
"Over the years, Otis dominated the escalator business, but lost the product's trademark. The word escalator lost its proprietary status and its capital "e" in 1950 when the U.S. Patent Office ruled that the word "escalator" had become just a common descriptive term for moving stairways."
Re:How will this work? (Score:4, Insightful)
Re:How will this work? (Score:3, Informative)
I've been waiting for a decent search engine app for the desktop - for both Linux & Windows. Grep is fine, but it's not always the best tool for the job. Alta vista had a decent
Easy.. Your PC is only 3.9e+x Billionth of google (Score:2)
The theory of search is scallable - just the amount of documents on your PC is of such a low scale that even low end PC's would return results quickly.
Re:How will this work? (Score:5, Funny)
Re:How will this work? (Score:5, Insightful)
OTOH I think Hans Reiser has it right, just look at his vision [namesys.com]. Built search from the filesystem up, and it will revolutionize how we think of data.
Great. (Score:5, Funny)
Do they search inside files? (Score:3, Interesting)
An updated law. (Score:4, Funny)
Factor it out, and distribute the task (Score:2)
That's actually perfectly sane for apps that need search capability, as long as:
(i) The search engine facility is factored out of app code so that there is only one instance in the system which can be invoked by any application that needs it; and
(ii) The search engine is distributed in some way, perhaps using a DNS architecture or a P2P one, since most of us don't have machines with the processing nor storage capacity of a Googl
Re:Factor it out, and distribute the task (Score:2, Insightful)
Well, thank you for your insight.
Good news (Score:2)
May I suggest naming the next KDE (Score:2, Funny)
Re:May I suggest naming the next KDE (Score:3, Interesting)
Microsoft had a booth at last year's Borland Developer's Conference, and had basically built a prototype of their file system search running on top of XP. The way it works is actually pretty well described in this interview: http://www.searchengineguide.com/beal/2004/0204_ab 1.html [searchengineguide.com].
Not rocket science, necessarily, but it was v
Re:May I suggest naming the next KDE (Score:2)
Like Spotlight? (Score:5, Interesting)
Re:Like Spotlight? (Score:5, Informative)
Similar, but more pervasive and based on some pretty different models for data collection and ranking. Unfortunately this article hits at a time when there really isn't any reasonable resource for what the plans are, but that should show up somewhere on public mailing lists in the next week or two.
The important differences are that it will be based on a generalized idea linkage inside of the desktop and that it won't be a stand alone tool, but a framework that can be used for having search-centric UIs throughout the desktop.
It was mentioned that this is similar to KMail and JuK at the moment; while I wrote the search code for both of those and that got some of the ideas rolling in my head, this is a pretty big jump from both of those.
Actual Conference Site (Score:2)
Re:Actual Conference Site (Score:5, Informative)
Um, what? (Score:3, Interesting)
Re:Um, what? (Score:2)
wow (Score:4, Interesting)
Ho hum (Score:4, Interesting)
Re:Ho hum (Score:2)
Re:Ho hum (Score:4, Interesting)
Re:Ho hum (Score:3, Interesting)
Google, and Tao (Score:5, Insightful)
It's a whole system, the Google/InterNet/Authors... you can't have parts of it standing alone.
--Mike--
Re:Google, and Tao (Score:3, Insightful)
Re:Google, and Tao (Score:2)
PageRank without links (Score:2)
The pagerank algorithm lives on links, which don't exist on most people's hard drives.
Google itself claims its algorithms work also without links. From their Google Search Appliance FAQ [google.com]:
Google's search algorithms use more than 100 factors to determine the ranking and relevancy of search results, many of them independent of link structure.
So it's worse without links but they had to figure out a way to get subscription fees from companies.
Google is good at this time but if they turn evil, nobody
Re:Google, and Tao (Score:2)
Re:Google, and Tao (Score:2)
What a rip...
Re:Google, and Tao (Score:2)
Re:Google, and Tao (Score:2)
The alogrithms underneeth all of the glizt and glamour of the internet apply to many different searching tasks. The engine may not be the same as "Page-rank" but they've got some really smart people over there at google. If KDE can develop (kdevelop?
Beagle? (Score:5, Insightful)
Re:Beagle? (Score:2)
Well, you could could say the same thing about people who have both KDE and Gnome on their system. I won't find it too hard to believe that two different desktop environments are going to have two different indexing schemes. Just like they have different file managers, defaul
Re:Beagle? (Score:3, Interesting)
Weren't we all slagging of Microsoft for implementing the EXACT same feature in Longhorn, i.e. a databased file system, not all that long ago? But now it's in gnome and kde, it's alright?
Re:Beagle? (Score:5, Informative)
the coolest thing about this is, you get live updates. so for example, if you have the spotlight search open in the top right and you've searched for "kansas," you can then go into a microsoft word doc and type kansas, then save the file. the file will instantly appear in the spotlight search.
i don't think linux has a solution just yet for live-updates, which is important for the integrity of the indexed data.
Why??? (Score:4, Insightful)
no (Score:3, Insightful)
I want to search engine to index my html files, expand
Feature request! (Score:5, Interesting)
Make it so it can be used from the command prompt. Make it so it can be used from GNOME. Make it so it can be used by other non-de X apps. Make it so it can be used by Apache, or Samba, or anything else running under UNIX.
Even better, make it compatible with Spotlight [apple.com]. The search API's are diagrammed at a low enough level that it might be a part of Darwin and not Aqua and thereby released as Free Software. But if it isn't, Apple is pushing Spotlight very hard and they want developers to get behind it and use it, so the specification should be pretty open and reproducable.
Re:Feature request! (Score:4, Interesting)
It will be truly ironic if both Apple and KDE beat MS to this punch. For the first time MS vaporware announcements will have caused actual products from competitiors before the vapor even settled.
Re:filesystem = database is from Beos (Score:3, Interesting)
It's the classic agile and fast mindset vs the monolithic authotarian mindset.
Re:Feature request! (Score:3, Informative)
http://www.gnome.org/projects/beagle/
h
Re:Feature request! (Score:2)
Reality Check (Score:3, Insightful)
But, this has to be done well. I mean, this has to be not desktop implementation centric, but filesystem/kernel centric. That is, in order for this to work really well, you need a filesystem that can categorize files and search through them efficiently (almost like a database).
Reiser4 may be able to do this, WinFS will do this (will have a mssql core), and if this all means a neat kde interface to locate or an extra indexing service, it will suck on linux.
So. It would be really cool if they put it up in freedesktop.org as an RFC so that the whole community may get involved. This cannot be the sole effort of KDE if it is to work well.
Google is fast, but not the best for the desktop (Score:5, Informative)
from Jef Raskin's
The Humane Interface [sourceforge.net]
Part II: WHAT INTERFACES SHOULD HAVE
A useful starting set of solutions to the problems outlined above includes
* A better text search methodology, effective both within a local document or system and with respect to extremely large data spaces such as the web
* A method of eliminating all modal aspects of the basic human-machine interface, a method that is readily learned by newcomers and which is habituating
* An improved navigation method, as applicable to finding your way around within a picture or memo as within a collection of images, documents, or networks; a method which makes use of inborn and learned human navigational skills
* A set of detail improvements to some existing mechanisms that make them consistent with the goals and principles of the rest of the design.
Better text searching requires that the search be extremely fast (the next instance appears within human reaction time), interactive at the typed character (or spoken morpheme) level, and not based on dialog box interaction. You should be able to change the pattern (what you are seeking an instance of) at any time, including during a search. The results should be shown in context and not as a list of documents or sites. A search mechanism that is sufficiently fast and powerful also can serve as a cursor positioning mechanism in text. Such a cursor positioning tool can be significantly faster than graphical pointing devices and can unify local and internetworked information retrieval.
-------------
Well, maybe KDE is not the right project to do that, and I should shut up and help with the project Jef Raskin himself has started, and is slowly being developed, The Humane Environment [sourceforge.net].
Rememberance Agent (Score:3, Interesting)
I had started coding up a Java-based front-end which monitored the X clipboard buffer, but didn't get very far - lack of time. What little code I did write can be found here [clothbot.org].
If the article... (Score:2, Interesting)
i.e. I want to configure my ftp server, so I type in "configure ftp server" and it returns an appropriate help document, or I search "foo" and it will search through my files for that phrase.
that's how I see it working, in any case, not just "find -name filename" because that would be reinventing the wheel...that's what slocate and find are for, or "find files"
search capabilities at the FS level (Score:4, Interesting)
the statement makes no sense (Score:3, Interesting)
Re:the statement makes no sense (Score:2)
"Google" the new version of "enterprise enabled"? (Score:5, Funny)
Duke Nukem: now with Google-style weapons lookup!
Norton Antivirus: Now with Google-style virus lookup!
AutoCad 2005: Now with Google-style component lookup!
Crazy world. Next thing you know they'll be hooking up lava-lamps to build machines.
Re:"Google" the new version of "enterprise enabled (Score:3, Funny)
Very odd quote from RedHat (Score:3, Insightful)
Paul Salazar, the European marketing director at Red Hat, said his company has chosen to focus on Linux on the server rather than on the desktop, due to the fact that it cannot compete with Microsoft's research and development budget.
This is a pretty odd statement coming from a corporation with all its eggs in the Open Source basket. One would normally expect a company to believe in the superiority of its chosen business model.
Surely it's an article of faith that the "virtual equivalent R&D budget" of the FOSS community is hundreds of times greater than Microsoft's corporate R&D budget, and our R&D manpower is thousands of times larger. What FOSS lacks (comparatively) is strong product focus, but many would say that that is a good thing.
Is RedHat having second thoughts, or was this just an unfortunate individual comment from a marketting droid?
Re:Very odd quote from RedHat (Score:2)
You read the article, so you've seen the amount of money they have, versus that spent by MS each year. There's absolutely no way on earth they can compete. I mean, Microsoft (whether you hate them or not
Re:Very odd quote from RedHat (Score:3, Informative)
Redhat's business model is:
That's they're business model and they aren't changing it. They're just focusing their market. In the desktop market, parts 1&2 don't allow them to compete with MS. The features needed to compete don't exist in the OS, and RH doesn'
Looks a bit... (Score:2, Insightful)
As for its comparison to WinFS if MS get it right. Which
Background (Score:5, Insightful)
"Google like" here means just "searching", but the result will in fact be more like WinFS than Google in that it is using file data and file metadata to index and find things. Interface-wise expect more quicksearch bars like the one in Kmail 1.7 (KDE 3.3.0, Till Adam) and JuK (Scott Wheeler).
See also a Blog entry of mine [koehntopp.de] (german language) in the same vein.
Where are the breakthroughs? (Score:5, Interesting)
I'm sure that it's being done to some extent, I would think that if you're a Phd doing advanced windowing research, you'd want your platform to be Linux so that you can code it the way you want.
While Linux is the natural choice to use for the breakthrough concepts, I really don't know of any. While Linux has *great* technology, and is definately an OS par excellence, it feels like it's more-or-less keeping up with the Joneses, instead of leading in new ideas and technologies. It's said that everyone waits for Apple to come up with something so that it can be copied. Well, why wait for them?
Maybe there isn't as much research going on as I would think (not being in Academia), or it's more of the "faster-smaller" variety, but when the "next big thing" happens in computing, I hope it is on Linux *first*.
Re:Where are the breakthroughs? (Score:5, Insightful)
I don't think Linux is a natural choice for breakthrough concepts in user interfaces. It is in lots of other areas and maybe in more basic research in UI but it's not user driven as much as OS X and Windows.
Both KDE and Gnome are good at constantly progressing, trying new things. And they're good at listening to users. But, I think they don't have the pressure/motivation and resources right now to come up with something truly novel.
I would guess that this partly because they're under some pressure to provide the functionality in OS X or Windows. They're playing catch-up almost constantly. Also, their flexibility seems to slow them down. There were big changes in Gnome 2.0 but they seemed more like a change in direction than movement forward. My impression of KDE is that they put most effort in the backend of the system where you're not likely to notice it. And both projects work hard to try to make everyone happy.
My impression of Apple is that Apple (Jobs) likes to think big picture and then throw lot of effort behind a handful of projects. The media hub picture spawned the iApps and iPod. The interface picture birthed Expose, Spotlight and that widget thing. MS seems to try to do (or at least promise) everything then implement it poorly, then keep plugging away at it until they get it right or give up. But they do get it right sometimes and they do try make things a 'better experience' for the user. I personally think they miss the mark more times than not because they burden their innovations with user-hostile elements like DRM.
OSS needs visionaries but to implement a vision you need everyone to get behind it. I think that's harder in OSS because visionaries seem a bit dictatorial. It's not impossible, I'm sure, but going from a mainly academic research project to something people can use is hard and probably needs a steady guiding hand.
Re:Where are the breakthroughs? (Score:3, Interesting)
Directional problem (Score:4, Insightful)
The functionality of the K-apps and how loosely they integrate (and don't integrate with anything else) is at about MS Office 95 level.
I see searchings importance on a scale of 1 to 10 at about a 3. Most users I know have everything in one folder, maybe a couple of nested folders and that's it. Not too hard to find stuff if there's one or two places to look.
-m
PS: Was there an article to read?
It's just journalists (Score:3, Interesting)
By the way the next version of KDE will be KDE 3.4, branching to KDE 4 when Qt 4 beta is available at the end of the year.
Transcripts from all the talks I went to are at http://conference2004.kde.org/sched-devconf.php [kde.org].
Jonathan Riddell
"KDE goes for IPO selling 145,233 shares at 1059,342euro each giving KDE a higher market capitalisation than Microsoft and AOL combined."
I'm feeling lucky.. (Score:4, Funny)
* note: not actually random, void where prohibited...
YES! (Score:3, Insightful)
Hierarchical trees are horrible ways to manage data, especially if it's a bunch of data that can be classified multiple ways and you typically won't remember everything you save.
There's no reason why
This would solve a lot of the hassle of organizing files. The only choice I'd have to make is how specific I want to get when choosing file names and directories.
Indexing of file contents is an added plus, but not even necessary for a huge gain in organization.
Time for the usual really bad name (Score:4, Funny)
Possible names that will sound cool only to the geekly-enabled:
- Dingo
- Dingus
- HeadCheez
- Buffy
- MK-47
- FUALL V56.34
- All-seeing crystal of Gompfor (unique artifact)
- STDcipher
Seriously, someone call a professional here...
You can already use google to search your computer (Score:3, Funny)
Simple solution: put all your files into your public_html folder!
I Think I Want A Semantic Desktop (Score:3, Interesting)
Not in terms of a filesystem, and not in terms of a tool that indexes everything and points a search engine at the index. Rather, I want something that overlays all that and imposes structure and organization on the information and knowledge that is recorded in all the those data.
Having done that, give me the tools to browse and manipulate it.
Gnome's Dashboard seems to be sniffing around the edges of this notion. But I'm thinking of something more significant, something that might potentially represent the user's concept of the machine, relegating filesystens, files, and data formats to lower and less relevant levels.
I.e. computer users do not need to be aware of the actual structure of their hard drive, or how their chips and circuit boards operate, because the OS and other software abstract all that out of the way. Why not bump it up?
Re:Speaking of bloat... (Score:5, Insightful)
It'll be part of KDE - where DE stands for Desktop Environment. KDE is much more than a window manager, it's an entire desktop system, so this is the perfect place to add it.
Re:Speaking of bloat... (Score:2)
Re:Speaking of bloat... (Score:3, Interesting)
Kfind (Score:2)
I totally agree. They would be just as well to adapt 'find' to a little doc app. Call it kfind. The hype surrounding Google is making the KDE folks do some silly things. Can't say I care what direction they go though. I use WindowMaker.
Re:Speaking of bloat... (Score:3, Interesting)
No, programs do it too.
"What other project could it possibly go under?"
It would be nicer if it were part of the filesystem. "Everything is a file" is a powerful concept.
find
Re:find -name myfilename (Score:2)
Re:find -name myfilename (Score:4, Funny)
doesn't work unless you remember to name files consistently.
Re:find -name myfilename (Score:3, Interesting)
Ok, so take your pick of Google results:
or
BBN had some interesting natural language parsing projects going in the early 90s, but google has
Re:find -name myfilename (Score:3, Insightful)
Re:find -name myfilename (Score:3, Informative)
Re:WinFS (Score:2)
Re:WinFS (Score:2)
Re:(karma burn) Re:F[sk]KING ADMINS (Score:3, Insightful)
Even Apple, supposed masters of great design, recognize a loser when they see one--aft
Re:FUCKING ADMINS (Score:3, Interesting)
Re:Something like the deskbar? (Score:2)
Re:Something like the deskbar? (Score:3, Informative)
Examples:
imdb:Rob Malda (Internet Movie Database)
gg:lucky (Google)
dict:sphygmomanometer (Merriam-Webster)
Re:Google (Score:3, Informative)
Re:Google (Score:2)
Re:Get your priorities straight (Score:3, Insightful)
Re:stick with grep (Score:3, Informative)
Somebody else suggesting just using "find / -exec grep -H pattern {} \;" but that would be a terrible solution. It isn't indexed. To find a unique pattern, it forces a scan of half the drive on average. Come on. Looking at half the data on the drive ju