Modify Vosk API Speech to Text Program

$30-250 USD

Κλειστή

Αναρτήθηκε

περίπου 3 χρόνια πριν

$30-250 USD

Πληρωμή κατά την παράδοση

Hi, Background: Vosk API is an offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node. My program: I have a speech to text GUI program using Vosk API that transcripts spoken words to text at the mouse cursors location. It has several features of which I would like to modify and several I would like to implement. Currently, it is using an addon called "Fastpunct" which automatically punctuates the sentences however this causes a huge delay to output the text so I am looking for a different solution. Also, the speech program has a feature to enable "commands" which means whenever you say a form of punctuation such as "Question key" it will translate to "?". Many of the commands are either working periodically or not at all. I would like to speak to someone about possibilities to fix this issue whether it is modifying the existing model or creating a separate one just for the command feature. Furthermore, I have much more future work/ideas that need implemented that we can discuss.

Machine Learning (ML)

Ταυτότητα εργασίας: 30194656

Σχετικά με την εργασία

6 προτάσεις

Απομακρυσμένη Εργασία

Ενεργός/ή 3 χρόνια πριν

Ψάχνεις τρόπο για να κερδίσεις μερικά χρήματα;

Διεύθυνση Email

Πλεονεκτήματα πλειοδοσίας στο Freelancer

Καθόρισε τον προϋπολογισμό σου και το χρονοδιάγραμμα

Πληρώσου για τη δουλειά σου

Περίγραψε την πρόταση σου

Η εγγραφή και η πλειοδοσία σε εργασίες είναι δωρεάν

6 freelancers δίνουν μια μέση προσφορά $140 USD για αυτή τη δουλειά

@Devrits

Hi Hiring manager I am Natural Language Processing, Speech Recognition and TTS Expert I'm a Masters Student in Natural Language Processing with extensive experience in Deep Learning, NLP, Speech Recognition and Text-to-Speech (TTS). I also have 4 years of full-time work experience as an ML Engineer. My current research work is on Adversarial training of End-to-End Speech Recognition. I am a problem solver and I believe in providing complete solutions. If you have a project in mind and you don't even know where to start, then you've come to the right person. I also have a vast knowledge of Statistical Learning like categorical data analysis, Bayesian statistics, decision trees, cluster analysis, and predictive modelling. I have also managed live production environments on AWS. ML frameworks: Tensorflow, PyTorch, Keras, Chainer, Theano, Apache Spark Python frameworks: numpy, pandas, scikit etc. Programming Languages: Java, Python, R, SQL Visualizations: d3js, ggplot, matplotlib etc. I'm flexible with my working hours and am happy to work closely with any existing freelancers you work with. You can expect prompt, polite and helpful communication from me. I look forward to hearing from you! Thanks

$250 USD σε 2 ημέρες

4,6

(27 αξιολογήσεις)

6,4

@kevinlee1238

Hi, Sir I am very interested in your project. I have the experience for your project. I think that we can discuss the project in chat. Best regards

$200 USD σε 15 ημέρες

4,9

(35 αξιολογήσεις)

6,1

@Mikhailpopov0724

Hello I can modify vosk API speech to text program perfectly per your requirements. I am a professional web developer with rich 8+years of experience in which I have built many websites so far. I have extensive experience in C/C++ and Machine Learning. I have experience in which I have done projects similar as your spec before. I have confidence in your project and can promise you high quality and give you satisfaction. If you are interested in my proposal, please sharing your detailed project with me. Looking forward to hear from you. Thanks.

$50 USD σε 7 ημέρες

5,0

(1 αξιολόγηση)

3,2

@tracygearth

I have C/C++ Java etc 8 years experience and have used many API in different commercial applications. Look forward to Anuj

$50 USD σε 1 ημέρα

4,9

(4 αξιολογήσεις)

3,1

@Manpreetsweden

❤️❤️❤️❤️❤️ DL Developer ❤️❤️❤️❤️❤️ Hello. Nice to meet you. I have read your job carefully and I am interested in this. I have plenty of experiences with Python libraries(keras, tensorflow, ...). I wish we will discuss more details via private chatting. Best regards. Manpreet. S

$150 USD σε 5 ημέρες

5,0

(2 αξιολογήσεις)

2,1

@OleksandLitkina

Hello, I'm very interested in your job as a speech processing engineer, who has many R&D experiences in LVSR(large vocabulary speech recognition) with Kaldi, deep speech2, deep speech, google API, IBM Watson, pocketsphinx. I've deployed many speech recognition applications as offline or streaming versions for many languages like English, Russian, French, Chinese, Spanish. I am also very good at DL like CNN, LSTM, GRU, RNN with PyTorch, and TensorFlow. I have full skill in the development of AI applications with Python, C++, Java on Windows, Linux, AWS. I've provided C++, python binding, the web interface of speech recognition application with Kaldi, wav2letter. I'm also very good at speech processing like speaker identification/verification/diarization, text to speech. I think we can try some voice commands or keyword spotting. And I've ever implemented custom punctuation for medical clinical data. I hope to discuss more it with chat. Thank you

$140 USD σε 7 ημέρες