Find Jobs
Hire Freelancers

Modify Vosk API Speech to Text Program

$30-250 USD

Κλειστή
Αναρτήθηκε περίπου 3 χρόνια πριν

$30-250 USD

Πληρωμή κατά την παράδοση
Hi, Background: Vosk API is an offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node. My program: I have a speech to text GUI program using Vosk API that transcripts spoken words to text at the mouse cursors location. It has several features of which I would like to modify and several I would like to implement. Currently, it is using an addon called "Fastpunct" which automatically punctuates the sentences however this causes a huge delay to output the text so I am looking for a different solution. Also, the speech program has a feature to enable "commands" which means whenever you say a form of punctuation such as "Question key" it will translate to "?". Many of the commands are either working periodically or not at all. I would like to speak to someone about possibilities to fix this issue whether it is modifying the existing model or creating a separate one just for the command feature. Furthermore, I have much more future work/ideas that need implemented that we can discuss.
Ταυτότητα εργασίας: 30194656

Σχετικά με την εργασία

6 προτάσεις
Απομακρυσμένη Εργασία
Ενεργός/ή 3 χρόνια πριν

Ψάχνεις τρόπο για να κερδίσεις μερικά χρήματα;

Πλεονεκτήματα πλειοδοσίας στο Freelancer

Καθόρισε τον προϋπολογισμό σου και το χρονοδιάγραμμα
Πληρώσου για τη δουλειά σου
Περίγραψε την πρόταση σου
Η εγγραφή και η πλειοδοσία σε εργασίες είναι δωρεάν
6 freelancers δίνουν μια μέση προσφορά $140 USD για αυτή τη δουλειά
Avatar Χρήστη
Hi Hiring manager I am Natural Language Processing, Speech Recognition and TTS Expert I'm a Masters Student in Natural Language Processing with extensive experience in Deep Learning, NLP, Speech Recognition and Text-to-Speech (TTS). I also have 4 years of full-time work experience as an ML Engineer. My current research work is on Adversarial training of End-to-End Speech Recognition. I am a problem solver and I believe in providing complete solutions. If you have a project in mind and you don't even know where to start, then you've come to the right person. I also have a vast knowledge of Statistical Learning like categorical data analysis, Bayesian statistics, decision trees, cluster analysis, and predictive modelling. I have also managed live production environments on AWS. ML frameworks: Tensorflow, PyTorch, Keras, Chainer, Theano, Apache Spark Python frameworks: numpy, pandas, scikit etc. Programming Languages: Java, Python, R, SQL Visualizations: d3js, ggplot, matplotlib etc. I'm flexible with my working hours and am happy to work closely with any existing freelancers you work with. You can expect prompt, polite and helpful communication from me. I look forward to hearing from you! Thanks
$250 USD σε 2 ημέρες
4,6 (27 αξιολογήσεις)
6,4
6,4
Avatar Χρήστη
Hi, Sir I am very interested in your project. I have the experience for your project. I think that we can discuss the project in chat. Best regards
$200 USD σε 15 ημέρες
4,9 (35 αξιολογήσεις)
6,1
6,1
Avatar Χρήστη
Hello I can modify vosk API speech to text program perfectly per your requirements. I am a professional web developer with rich 8+years of experience in which I have built many websites so far. I have extensive experience in C/C++ and Machine Learning. I have experience in which I have done projects similar as your spec before. I have confidence in your project and can promise you high quality and give you satisfaction. If you are interested in my proposal, please sharing your detailed project with me. Looking forward to hear from you. Thanks.
$50 USD σε 7 ημέρες
5,0 (1 αξιολόγηση)
3,2
3,2
Avatar Χρήστη
I have C/C++ Java etc 8 years experience and have used many API in different commercial applications. Look forward to Anuj
$50 USD σε 1 ημέρα
4,9 (4 αξιολογήσεις)
3,1
3,1
Avatar Χρήστη
❤️❤️❤️❤️❤️ DL Developer ❤️❤️❤️❤️❤️ Hello. Nice to meet you. I have read your job carefully and I am interested in this. I have plenty of experiences with Python libraries(keras, tensorflow, ...). I wish we will discuss more details via private chatting. Best regards. Manpreet. S
$150 USD σε 5 ημέρες
5,0 (2 αξιολογήσεις)
2,1
2,1
Avatar Χρήστη
Hello, I'm very interested in your job as a speech processing engineer, who has many R&D experiences in LVSR(large vocabulary speech recognition) with Kaldi, deep speech2, deep speech, google API, IBM Watson, pocketsphinx. I've deployed many speech recognition applications as offline or streaming versions for many languages like English, Russian, French, Chinese, Spanish. I am also very good at DL like CNN, LSTM, GRU, RNN with PyTorch, and TensorFlow. I have full skill in the development of AI applications with Python, C++, Java on Windows, Linux, AWS. I've provided C++, python binding, the web interface of speech recognition application with Kaldi, wav2letter. I'm also very good at speech processing like speaker identification/verification/diarization, text to speech. I think we can try some voice commands or keyword spotting. And I've ever implemented custom punctuation for medical clinical data. I hope to discuss more it with chat. Thank you
$140 USD σε 7 ημέρες
5,0 (3 αξιολογήσεις)
1,4
1,4

Σχετικά με τον πελάτη

Σημαία της UNITED STATES
Mcdonald, United States
5,0
9
Επαληθευμένη μέθοδος πληρωμής
Μέλος από Αυγ 12, 2016

Επαλήθευση Πελάτη

Ευχαριστούμε! Σου έχουμε στείλει ένα email με ένα σύνδεσμο για να διεκδικήσεις τη δωρεάν πίστωση σου.
Κάτι πήγε στραβά κατά την προσπάθεια αποστολής του email σου. Παρακαλούμε δοκίμασε ξανά.
Εγγεγραμμένοι Χρήστες Συνολικές Αναρτημένες Δουλειές
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Φόρτωση προεπισκόπησης
Δόθηκε πρόσβαση για Geolocation.
Η σύνδεση σου έχει λήξει και τώρα έχεις αποσυνδεθεί. Παρακαλούμε συνδέσου ξανά.