I need an implementation of the algorithm outlined in the paper <[login to view URL]>. Alternative algorithms will also be accepted provided that the requirements listed below are met. Special consideration will be given to bidders who offer advise or recommendations of algorithms.
****** CHANGE 25/11/2007*****
This system must support multiple languages (English, French, etc.) here it is not required for the coder to implement all of these, only that the object model is modular and allows the different language elements to be easily specified (e.g. stop words, sentance delimiters, stemming algorithm). Initially only English is required. Clearly, the more languages you can initially provide to demonstrate this the better- I suspect many coders would be able to provide English and one other- this is not essential though.
**************************
This algorithm should be written in C# and have the following features:
1) Extraction of n key phrases from the source text (provided as a string). The source text may be provided in different encodings depending on the language- the code should cope with this by allowing the [login to view URL] to be specified.
2) Include a generic interface for stemming algorithms and a version of the porter stemming algorithm for English.
3) Include a generic mechanism for including STOP words for different languages.
4) Provide a flexible mechanism of splitting the source text into sentances to allow for differences in languages.
Bidders should outline their approach to this solution via a list of classes/interfaces or ideally and object model example and give an indication of similar projects or experience.
## Deliverables
All code must be written in C#, ideally in version 3.5 of the .Net framework taking advantage of the facilities provided by this. There must be no reliance on external libraries other than the .Net framework.
1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.
2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):
a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.
b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.
3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).
* * *This broadcast message was sent to all bidders on Wednesday Nov 28, 2007 6:04:47 AM:
I have removed one of the requirements from the project. I no longer need to differentiate between nouns and verbs. This used to be requirement 4. Thanks
## Platform
.Net 3.5 framework
C#
The solution must be completely free-standing and require no external libraries of any kind except those provided by the .Net framework.