title
Python Jarvis Tutorial - Part 1 | Creating Voice Assistant Using Python Speech Recognition | Edureka

description
馃敟Edureka Python for Data Science: https://www.edureka.co/data-science-python-certification-course This Edureka video on Python Jarvis Tutorial - Part 1 ( Part 2 - https://youtu.be/7Mwcmo1o_ik ) helps you to understand how we can make a speech recognition enabled bot for automating tasks. The following topics are discussed in the session: Idea behind Jarvis The Objective of The Project How To Convert Speech To Text How To Convert Text To Speech ---------------------------------------------- 馃敶Subscribe to our channel to get video updates. Hit the subscribe button above: https://goo.gl/6ohpTV Instagram: https://www.instagram.com/edureka_lea... Facebook: https://www.facebook.com/edurekaIN/ Twitter: https://twitter.com/edurekain LinkedIn: https://www.linkedin.com/company/edureka Telegram: https://t.me/edurekaupdates SlideShare: https://www.slideshare.net/EdurekaIN Meetup: https://www.meetup.com/edureka/ #Edureka #PythonEdureka #Pythonjarvis #pythonprojects #pythonprogramming #pythontutorial #PythonTraining - - - - - - - - - - - - - - - - - About the Course Edureka's Python Online Certification Training will make you an expert in Python programming. It will also help you learn Python the Big data way with integration of Machine learning, Pig, Hive and Web Scraping through beautiful soup. During our Python Certification training, our instructors will help you: 1. Master the Basic and Advanced Concepts of Python 2. Understand Python Scripts on UNIX/Windows, Python Editors and IDEs 3. Master the Concepts of Sequences and File operations 4. Learn how to use and create functions, sorting different elements, Lambda function, error handling techniques and Regular expressions ans using modules in Python 5. Gain expertise in machine learning using Python and build a Real Life Machine Learning application 6. Understand the supervised and unsupervised learning and concepts of Scikit-Learn 7. Master the concepts of MapReduce in Hadoop 8. Learn to write Complex MapReduce programs 9. Understand what is PIG and HIVE, Streaming feature in Hadoop, MapReduce job running with Python 10. Implementing a PIG UDF in Python, Writing a HIVE UDF in Python, Pydoop and/Or MRjob Basics 11. Master the concepts of Web scraping in Python 12. Work on a Real Life Project on Big Data Analytics using Python and gain Hands on Project Experience - - - - - - - - - - - - - - - - - - - Why learn Python? Programmers love Python because of how fast and easy it is to use. Python cuts development time in half with its simple to read syntax and easy compilation feature. Debugging your programs is a breeze in Python with its built in debugger. Using Python makes Programmers more productive and their programs ultimately better. Python continues to be a favorite option for data scientists who use it for building and using Machine learning applications and other scientific computations. Python runs on Windows, Linux/Unix, Mac OS and has been ported to Java and .NET virtual machines. Python is free to use, even for the commercial products, because of its OSI-approved open source license. Python has evolved as the most preferred Language for Data Analytics and the increasing search trends on python also indicates that Python is the next "Big Thing" and a must for Professionals in the Data Analytics domain. - - - - - - - - - - - - - - - - - - - Who should go for python? Edureka鈥檚 Data Science certification course in Python is a good fit for the below professionals: 路 Programmers, Developers, Technical Leads, Architects 路 Developers aspiring to be a 鈥楳achine Learning Engineer' 路 Analytics Managers who are leading a team of analysts 路 Business Analysts who want to understand Machine Learning (ML) Techniques 路 Information Architects who want to gain expertise in Predictive Analytics 路 'Python' professionals who want to design automatic predictive models For more information, Please write back to us at sales@edureka.in or call us at IND: 9606058406/ US: 18338555775 (toll free)

detail
{'title': 'Python Jarvis Tutorial - Part 1 | Creating Voice Assistant Using Python Speech Recognition | Edureka', 'heatmap': [{'end': 332.095, 'start': 295.273, 'weight': 0.937}, {'end': 621.649, 'start': 583.133, 'weight': 1}], 'summary': "Learn to create a speech-enabled assistant like jarvis in python, covering ai and ml in speech recognition, setting up a speech recognition project using anaconda environment, implementing speech recognition and text-to-speech, and automating music playback with error handling and language recognition, offered by edureka's tutorial.", 'chapters': [{'end': 83.421, 'segs': [{'end': 83.421, 'src': 'embed', 'start': 38.505, 'weight': 0, 'content': [{'end': 40.327, 'text': "Let's take a look at the agenda for this session.", 'start': 38.505, 'duration': 1.822}, {'end': 47.815, 'text': "So firstly, I'll start with the basic idea behind Jarvis and then I will discuss the developments around the idea moving further.", 'start': 40.828, 'duration': 6.987}, {'end': 50.358, 'text': 'I will explain the objective of our project.', 'start': 47.835, 'duration': 2.523}, {'end': 54.763, 'text': 'And finally I will explain how we convert the speech to text and vice versa.', 'start': 50.638, 'duration': 4.125}, {'end': 59.706, 'text': 'I hope you are clear with the agenda and this is part one of the session.', 'start': 56.084, 'duration': 3.622}, {'end': 68.612, 'text': 'So I will focus on the speech recognition and text to speech part here and we will incorporate the automations or task in the next session.', 'start': 60.027, 'duration': 8.585}, {'end': 77.437, 'text': 'And if you are new here, subscribe to Erica for more exciting tutorials and press the bell icon to get the latest updates on Eddie Raker.', 'start': 69.372, 'duration': 8.065}, {'end': 80.859, 'text': "and do check out Eddie Rika's AI ML certification program.", 'start': 77.437, 'duration': 3.422}, {'end': 83.421, 'text': 'The link is given in the description box below.', 'start': 81.22, 'duration': 2.201}], 'summary': 'Session covers jarvis development, speech-to-text, and text-to-speech conversion with future plans for automation. subscribe to erica for updates on eddie raker and ai ml certification program.', 'duration': 44.916, 'max_score': 38.505, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/0sjRkz1UIDQ/pics/0sjRkz1UIDQ38505.jpg'}], 'start': 6.496, 'title': 'Creating speech-enabled assistant', 'summary': 'Covers creating a speech-enabled assistant like jarvis in python, discussing the agenda, project objectives, incorporating speech recognition and text to speech, and includes a call to action to subscribe to edureka for more tutorials and explore their ai ml certification program.', 'chapters': [{'end': 83.421, 'start': 6.496, 'title': 'Creating speech-enabled assistant like jarvis', 'summary': 'Covers creating a speech-enabled assistant like jarvis in python, including discussing the agenda, project objectives, and incorporating speech recognition and text to speech, with a call to action to subscribe to edureka for more tutorials and explore their ai ml certification program.', 'duration': 76.925, 'highlights': ['Explaining the objective of the project, discussing developments around the idea, and converting speech to text and vice versa.', 'Focusing on speech recognition and text to speech, and incorporating automations or tasks in the next session.', 'Encouraging new viewers to subscribe to Edureka for more tutorials and explore their AI ML certification program.']}], 'duration': 76.925, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/0sjRkz1UIDQ/pics/0sjRkz1UIDQ6496.jpg', 'highlights': ['Discuss project objective, developments, and speech conversion', 'Focus on speech recognition, text to speech, and future automations', 'Encourage viewers to subscribe to Edureka for tutorials and AI ML program']}, {'end': 294.892, 'segs': [{'end': 116.857, 'src': 'embed', 'start': 84.053, 'weight': 4, 'content': [{'end': 86.694, 'text': "Now without any further ado, let's start our session.", 'start': 84.053, 'duration': 2.641}, {'end': 92.877, 'text': 'So the idea of Jarvis was picked up from a fictional movie of the Marvel Cinematic Universe.', 'start': 87.895, 'duration': 4.982}, {'end': 97.759, 'text': "I'm sure most of us must be aware of the character in the Marvel Cinematic Universe.", 'start': 93.417, 'duration': 4.342}, {'end': 105.943, 'text': 'That is Iron Man and where the protagonist has a AI powered bot who is able to do tasks on the mere commands.', 'start': 98.279, 'duration': 7.664}, {'end': 109.811, 'text': 'So sometime Tony Stark who is the protagonist of the film.', 'start': 106.969, 'duration': 2.842}, {'end': 116.857, 'text': "He actually has a board, which is called Jarvis in the movie, and basically he's able to give commands to that board,", 'start': 110.292, 'duration': 6.565}], 'summary': 'The idea of jarvis was inspired by the ai bot in the marvel cinematic universe, particularly in the movie iron man.', 'duration': 32.804, 'max_score': 84.053, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/0sjRkz1UIDQ/pics/0sjRkz1UIDQ84053.jpg'}, {'end': 165.624, 'src': 'embed', 'start': 126.965, 'weight': 1, 'content': [{'end': 133.711, 'text': 'And it actually gained a lot of popularity and curiosity among developers and organizations to actually make it a reality.', 'start': 126.965, 'duration': 6.746}, {'end': 142.986, 'text': "and it wasn't long until people started to get breakthrough achievements, and then the AI and ML, or artificial intelligence and machine learning,", 'start': 134.562, 'duration': 8.424}, {'end': 144.926, 'text': 'actually picked up a lot of pace,', 'start': 142.986, 'duration': 1.94}, {'end': 151.909, 'text': 'and after that what seemed like a dream when we saw it in movie has seemingly become a reality in the last couple of years.', 'start': 144.926, 'duration': 6.983}, {'end': 160.213, 'text': 'Now we have conversational AI boards who can take our request through speech recognition and perform several tasks.', 'start': 152.55, 'duration': 7.663}, {'end': 165.624, 'text': 'For instance, playing music, a movie recommendation,', 'start': 160.782, 'duration': 4.842}], 'summary': 'Ai and ml gained popularity, leading to breakthrough achievements and the development of conversational ai boards for tasks like music playing and movie recommendations.', 'duration': 38.659, 'max_score': 126.965, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/0sjRkz1UIDQ/pics/0sjRkz1UIDQ126965.jpg'}, {'end': 258.653, 'src': 'embed', 'start': 212.118, 'weight': 0, 'content': [{'end': 216.64, 'text': "I'm sure most of us know him actually have built a personalized home assistant.", 'start': 212.118, 'duration': 4.522}, {'end': 225.011, 'text': 'So he actually incorporated facial recognition as well from a Facebook that is a rather complex or advanced approach.', 'start': 217.364, 'duration': 7.647}, {'end': 233.679, 'text': 'I would say to use object detection and all for doing this Jarvis project, but it is a very advanced project that he had made,', 'start': 225.051, 'duration': 8.628}, {'end': 235.68, 'text': 'but in this session we will stick to the basics.', 'start': 233.679, 'duration': 2.001}, {'end': 239.944, 'text': "So let's go ahead and take a look at the project objective that we're going to follow guys.", 'start': 236.301, 'duration': 3.643}, {'end': 244.328, 'text': 'So the project objective is to build a speech triggered bot.', 'start': 240.745, 'duration': 3.583}, {'end': 252.411, 'text': 'So basically what will happen is it will recognize voice and automate several tasks for us now for the speech recognition.', 'start': 245.168, 'duration': 7.243}, {'end': 258.653, 'text': 'We will use the speech recognition library in Python and for the text to speech function of the board.', 'start': 252.631, 'duration': 6.022}], 'summary': 'Build a speech-triggered bot to recognize voice and automate tasks using python.', 'duration': 46.535, 'max_score': 212.118, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/0sjRkz1UIDQ/pics/0sjRkz1UIDQ212118.jpg'}], 'start': 84.053, 'title': 'Ai and speech recognition', 'summary': 'Covers the development of jarvis ai inspired by iron man, and the application of ai and ml in speech recognition and automation, with a focus on building a speech-triggered bot for task automation using python libraries.', 'chapters': [{'end': 125.664, 'start': 84.053, 'title': 'Developing jarvis ai', 'summary': "Discusses the inspiration for jarvis ai from the marvel cinematic universe's iron man, where the protagonist uses an ai-powered bot to perform tasks on voice commands, reflecting futuristic capabilities.", 'duration': 41.611, 'highlights': ["The protagonist in the Marvel Cinematic Universe's Iron Man uses an AI-powered bot called Jarvis to perform tasks on voice commands, showcasing futuristic capabilities.", 'The idea of Jarvis was inspired by a fictional movie of the Marvel Cinematic Universe, where the protagonist has an AI-powered bot capable of performing advanced tasks.']}, {'end': 294.892, 'start': 126.965, 'title': 'Ai speech recognition and automation', 'summary': "Discusses the rapid development and application of ai and ml in speech recognition and automation, highlighting instances such as conversational ai boards and mark zuckerberg's personalized home assistant, and outlines the project objective of building a speech-triggered bot for automating tasks using python libraries.", 'duration': 167.927, 'highlights': ['The rapid development and application of AI and ML in speech recognition and automation has gained popularity and curiosity among developers and organizations, leading to breakthrough achievements. The development and application of AI and ML in speech recognition and automation has gained popularity and curiosity among developers and organizations, with breakthrough achievements being witnessed.', 'The emergence of conversational AI boards capable of performing various tasks through speech recognition, such as playing music, providing movie recommendations, and retrieving weather reports, showcases the advancement in AI technology. Conversational AI boards can perform tasks through speech recognition, including playing music, providing movie recommendations, and retrieving weather reports, showcasing the advancement in AI technology.', "Mark Zuckerberg's development of a personalized home assistant incorporating facial recognition demonstrates an advanced approach utilizing object detection for the Jarvis project. Mark Zuckerberg developed a personalized home assistant incorporating facial recognition, demonstrating an advanced approach utilizing object detection for the Jarvis project.", 'The project objective is to build a speech-triggered bot using Python libraries for speech recognition and text-to-speech functions, with tasks including web browser automation, playing music, obtaining movie recommendations, retrieving weather reports, and translations. The project objective is to build a speech-triggered bot using Python libraries for speech recognition and text-to-speech functions, with tasks including web browser automation, playing music, obtaining movie recommendations, retrieving weather reports, and translations.']}], 'duration': 210.839, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/0sjRkz1UIDQ/pics/0sjRkz1UIDQ84053.jpg', 'highlights': ['The project objective is to build a speech-triggered bot using Python libraries for speech recognition and text-to-speech functions, with tasks including web browser automation, playing music, obtaining movie recommendations, retrieving weather reports, and translations.', 'The emergence of conversational AI boards capable of performing various tasks through speech recognition, such as playing music, providing movie recommendations, and retrieving weather reports, showcases the advancement in AI technology.', 'Mark Zuckerberg developed a personalized home assistant incorporating facial recognition, demonstrating an advanced approach utilizing object detection for the Jarvis project.', 'The rapid development and application of AI and ML in speech recognition and automation has gained popularity and curiosity among developers and organizations, leading to breakthrough achievements.', "The protagonist in the Marvel Cinematic Universe's Iron Man uses an AI-powered bot called Jarvis to perform tasks on voice commands, showcasing futuristic capabilities."]}, {'end': 544.427, 'segs': [{'end': 332.095, 'src': 'heatmap', 'start': 295.273, 'weight': 0.937, 'content': [{'end': 300.846, 'text': 'So let me just take you to pie charm guys and I have written down all this code already.', 'start': 295.273, 'duration': 5.573}, {'end': 304.347, 'text': "So I'll just I just remove this and begin from the beginning.", 'start': 300.866, 'duration': 3.481}, {'end': 310.91, 'text': "So what you first have to do is we will create anaconda environment for our project since I've already done it.", 'start': 304.867, 'duration': 6.043}, {'end': 312.11, 'text': "I'll just show you what it is.", 'start': 310.93, 'duration': 1.18}, {'end': 319.573, 'text': "So if you don't have anaconda install you can follow our tutorial which is how to use anaconda for python.", 'start': 312.81, 'duration': 6.763}, {'end': 325.329, 'text': 'and then you can choose the interpreter over here, which is Django project.', 'start': 320.584, 'duration': 4.745}, {'end': 332.095, 'text': "I have used this saw, which is an anaconda environment, and after that you're going to need a few dependencies for speech recognition,", 'start': 325.389, 'duration': 6.706}], 'summary': 'Creating an anaconda environment for a django project and adding dependencies for speech recognition.', 'duration': 36.822, 'max_score': 295.273, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/0sjRkz1UIDQ/pics/0sjRkz1UIDQ295273.jpg'}, {'end': 402.03, 'src': 'embed', 'start': 332.095, 'weight': 0, 'content': [{'end': 339.462, 'text': "because to record the speech from microphone you're going to need by audio module from python and,", 'start': 332.095, 'duration': 7.367}, {'end': 344.407, 'text': 'while installing or importing the pi audio module in pi charm, using the python interpreter.', 'start': 339.462, 'duration': 4.945}, {'end': 347.968, 'text': 'What happens is you get a lot of errors on Windows.', 'start': 345.067, 'duration': 2.901}, {'end': 353.47, 'text': "So that is the reason why I've chosen anaconda for this project, and simply you can just type in the command,", 'start': 348.428, 'duration': 5.042}, {'end': 359.793, 'text': 'which is pip install by audio and pip install speech recognition in your terminal, or you can add it from here as well.', 'start': 353.47, 'duration': 6.323}, {'end': 364.815, 'text': "If you don't find it, you can just use the pitcon in the terminal over here right?", 'start': 359.833, 'duration': 4.982}, {'end': 378.245, 'text': "So now what I'll do is I just use this speech recognition library to get speech or record speech from my microphone and the recorded as text,", 'start': 365.715, 'duration': 12.53}, {'end': 380.647, 'text': 'so we can follow the official documentation as well.', 'start': 378.245, 'duration': 2.402}, {'end': 383.909, 'text': 'I mean to see how we can install and import speech recognition in python.', 'start': 380.667, 'duration': 3.242}, {'end': 387.592, 'text': "Okay, I'll just show you what the official documentation looks like.", 'start': 383.929, 'duration': 3.663}, {'end': 390.334, 'text': 'So this is the speech recognition of pi pi.', 'start': 387.632, 'duration': 2.702}, {'end': 392.556, 'text': 'This is the project pi pi.org.', 'start': 390.795, 'duration': 1.761}, {'end': 399.141, 'text': 'inside this you can find a lot of details, like you have library for performing speech recognition with support for several,', 'start': 392.556, 'duration': 6.585}, {'end': 402.03, 'text': 'and API is online and offline as well.', 'start': 399.628, 'duration': 2.402}], 'summary': 'Using anaconda, pip install pyaudio and speech recognition to record speech from microphone in python.', 'duration': 69.935, 'max_score': 332.095, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/0sjRkz1UIDQ/pics/0sjRkz1UIDQ332095.jpg'}, {'end': 510.008, 'src': 'embed', 'start': 424.825, 'weight': 4, 'content': [{'end': 426.346, 'text': 'So first of all, you have to import.', 'start': 424.825, 'duration': 1.521}, {'end': 438.929, 'text': "I'll import it as SR because it's easy guys and the next thing that you have to do is make instance using the recognizer class.", 'start': 430.447, 'duration': 8.482}, {'end': 442.63, 'text': "So I'll just talk about the recognizer class first of all.", 'start': 439.389, 'duration': 3.241}, {'end': 448.432, 'text': 'so we actually need a recognizer class which is basically used to create an instance, to get the speech from a source.', 'start': 442.63, 'duration': 5.802}, {'end': 451.333, 'text': 'I mean our case that source is going to be microphone.', 'start': 448.792, 'duration': 2.541}, {'end': 458.375, 'text': "So let's just create an instance call it our so SR dot recognizer.", 'start': 451.993, 'duration': 6.382}, {'end': 459.555, 'text': 'This is how I initialize it.', 'start': 458.415, 'duration': 1.14}, {'end': 471.322, 'text': "and now what I do is I'll take one command so with SR dot microphone as our source.", 'start': 460.339, 'duration': 10.983}, {'end': 483.126, 'text': "This is the source that we're going to use and we will use this to get the speech guys and to get the speech we will use the SR dot listen.", 'start': 471.683, 'duration': 11.443}, {'end': 484.647, 'text': 'Oh, wait a second guys.', 'start': 483.586, 'duration': 1.061}, {'end': 494.878, 'text': "And now what we'll do is let's just say We will use the R that we have created and listen to the source guys.", 'start': 485.827, 'duration': 9.051}, {'end': 503.924, 'text': 'So this will be our text and after this we have to use the Google Cloud API to recognize the text guys.', 'start': 495.419, 'duration': 8.505}, {'end': 510.008, 'text': 'I mean to put the audio inside that class and this is where pi audio is actually used.', 'start': 504.444, 'duration': 5.564}], 'summary': 'Initializing and using speech recognition with python, using sr and google cloud api.', 'duration': 85.183, 'max_score': 424.825, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/0sjRkz1UIDQ/pics/0sjRkz1UIDQ424825.jpg'}], 'start': 295.273, 'title': 'Speech recognition with python', 'summary': 'Covers setting up a speech recognition project using anaconda environment, installing necessary dependencies like pyaudio and speechrecognition, and utilizing the google cloud api to recognize speech.', 'chapters': [{'end': 424.765, 'start': 295.273, 'title': 'Setting up speech recognition in python', 'summary': "Covers setting up a speech recognition project using anaconda environment, installing necessary dependencies like pyaudio and speechrecognition, and using the official documentation for python's speechrecognition library.", 'duration': 129.492, 'highlights': ["Anaconda environment is used for the project, eliminating potential errors on Windows when installing PiAudio module, and SpeechRecognition is installed via 'pip' command in terminal or PyCharm.", "Setting up a speech recognition project involves creating an Anaconda environment, installing necessary dependencies like PyAudio and SpeechRecognition, and using the official documentation for Python's SpeechRecognition library.", "The official documentation for Python's SpeechRecognition library provides detailed information, including libraries for performing speech recognition with support for several APIs, including Google speech recognition API."]}, {'end': 544.427, 'start': 424.825, 'title': 'Speech recognition using python', 'summary': 'Explains the process of speech recognition using the python library, covering the steps of importing the library, creating an instance of the recognizer class, using the microphone as the source, and utilizing the google cloud api to recognize speech.', 'duration': 119.602, 'highlights': ['The process starts with importing the necessary library and creating an instance of the recognizer class.', 'Using the microphone as the source, the speech is obtained using the SR.listen method.', 'The recognized speech is then processed using the Google Cloud API to convert the audio into text.']}], 'duration': 249.154, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/0sjRkz1UIDQ/pics/0sjRkz1UIDQ295273.jpg', 'highlights': ['Utilize Anaconda environment to avoid errors on Windows when installing PiAudio module', "Install SpeechRecognition via 'pip' command in terminal or PyCharm", 'Create Anaconda environment, install PyAudio and SpeechRecognition, refer to official documentation', "Official documentation for Python's SpeechRecognition library provides detailed information and support for several APIs", 'Import necessary library, create instance of recognizer class', 'Obtain speech using microphone as source and SR.listen method', 'Process recognized speech using Google Cloud API to convert audio into text']}, {'end': 988.536, 'segs': [{'end': 621.649, 'src': 'heatmap', 'start': 583.133, 'weight': 1, 'content': [{'end': 584.754, 'text': "It's our daughter.", 'start': 583.133, 'duration': 1.621}, {'end': 594.8, 'text': "Request arrow as E and for now, we'll just leave it empty to just run the program guys.", 'start': 586.735, 'duration': 8.065}, {'end': 597.181, 'text': 'All right.', 'start': 596.821, 'duration': 0.36}, {'end': 604.975, 'text': 'Okay. So, as you can see, guys, I said okay and it took my or speech, recognized my speech,', 'start': 597.201, 'duration': 7.774}, {'end': 609.779, 'text': 'converted it into text and gave me a print statement with the text over here', 'start': 604.975, 'duration': 4.804}, {'end': 613.542, 'text': 'So this is how easily you can create speech to text guys.', 'start': 610.379, 'duration': 3.163}, {'end': 614.443, 'text': 'All right.', 'start': 614.183, 'duration': 0.26}, {'end': 621.649, 'text': 'So for now, our purpose to use this speech to text is basically, you know, when I say something to the board,', 'start': 614.903, 'duration': 6.746}], 'summary': 'Demonstration of speech-to-text program with easy conversion of speech to text.', 'duration': 38.516, 'max_score': 583.133, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/0sjRkz1UIDQ/pics/0sjRkz1UIDQ583133.jpg'}, {'end': 707.125, 'src': 'embed', 'start': 676.644, 'weight': 2, 'content': [{'end': 679.026, 'text': 'All right, so this is one whole sentence guys.', 'start': 676.644, 'duration': 2.382}, {'end': 681.007, 'text': "I'm sure okay now.", 'start': 679.046, 'duration': 1.961}, {'end': 682.368, 'text': 'I think you could see this now.', 'start': 681.107, 'duration': 1.261}, {'end': 687.274, 'text': 'So this is the whole sentence it actually recorded from a speech.', 'start': 682.768, 'duration': 4.506}, {'end': 692.296, 'text': "recognize my voice in plain, simple English, and let's just try and ask a few other things.", 'start': 687.274, 'duration': 5.022}, {'end': 694.458, 'text': "So let's speak in Spanish for all.", 'start': 693.017, 'duration': 1.441}, {'end': 700.521, 'text': 'hola como estas? Right?', 'start': 694.458, 'duration': 6.063}, {'end': 707.125, 'text': "So it is a recognizing Spanish as well, and let's talk gibberish, guys.", 'start': 700.641, 'duration': 6.484}], 'summary': 'Speech recognition in english and spanish, also recognizes gibberish.', 'duration': 30.481, 'max_score': 676.644, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/0sjRkz1UIDQ/pics/0sjRkz1UIDQ676644.jpg'}, {'end': 750.876, 'src': 'embed', 'start': 725.238, 'weight': 0, 'content': [{'end': 731.74, 'text': 'So this is all about the speech recognition guys, in which we are talking something and the program is actually recognizing our speech,', 'start': 725.238, 'duration': 6.502}, {'end': 733.04, 'text': 'converting it into text.', 'start': 731.74, 'duration': 1.3}, {'end': 738.102, 'text': 'So the next part in our session is text to speech guys.', 'start': 733.88, 'duration': 4.222}, {'end': 746.752, 'text': 'So before that, let me just give you a preview of what it actually looks like so for text-to-speech this part of the session.', 'start': 738.883, 'duration': 7.869}, {'end': 750.876, 'text': "I'm going to use the pipe TTS x3 library and again,", 'start': 746.852, 'duration': 4.024}], 'summary': 'The session covers speech recognition and text-to-speech using the pipe tts x3 library.', 'duration': 25.638, 'max_score': 725.238, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/0sjRkz1UIDQ/pics/0sjRkz1UIDQ725238.jpg'}], 'start': 545.368, 'title': 'Speech recognition and text-to-speech', 'summary': "Covers the implementation of speech recognition and text-to-speech using python, demonstrating how to convert speech to text and vice versa, as well as adjusting properties such as voice and volume. it also discusses error handling and language recognition, showcasing the application's versatility in recognizing various languages and gibberish.", 'chapters': [{'end': 988.536, 'start': 545.368, 'title': 'Speech recognition and text-to-speech', 'summary': "Covers the implementation of speech recognition and text-to-speech using python, demonstrating how to convert speech to text and vice versa, as well as adjusting properties such as voice and volume. it also discusses error handling and language recognition, showcasing the application's versatility in recognizing various languages and gibberish.", 'duration': 443.168, 'highlights': ["The program recognizes speech and converts it into text, enabling automated functions based on recognized input, such as playing music or performing tasks. The speech recognition functionality allows for the conversion of spoken words into text, facilitating the automation of tasks based on the recognized input. For instance, the program can interpret commands like 'play music' to trigger specific actions.", "Demonstrates the ability to recognize languages like English, Spanish, and gibberish, showcasing the program's versatility in language recognition. The speech recognition system exhibits the capability to understand multiple languages, including English and Spanish, as well as gibberish, highlighting its versatility in language recognition and processing.", 'Utilizes the pi TTS x3 library for text-to-speech functionality, offering features like adjusting voice properties and volume control. The text-to-speech functionality is implemented using the pi TTS x3 library, providing the ability to modify voice properties and adjust volume levels, allowing for customization of the speech output.']}], 'duration': 443.168, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/0sjRkz1UIDQ/pics/0sjRkz1UIDQ545368.jpg', 'highlights': ['The program recognizes speech and converts it into text, enabling automated functions based on recognized input.', 'Utilizes the pi TTS x3 library for text-to-speech functionality, offering features like adjusting voice properties and volume control.', "Demonstrates the ability to recognize languages like English, Spanish, and gibberish, showcasing the program's versatility in language recognition."]}, {'end': 1488.304, 'segs': [{'end': 1095.892, 'src': 'embed', 'start': 1050.91, 'weight': 4, 'content': [{'end': 1051.25, 'text': 'All right.', 'start': 1050.91, 'duration': 0.34}, {'end': 1052.67, 'text': 'So we are getting the text here.', 'start': 1051.27, 'duration': 1.4}, {'end': 1064.272, 'text': "What now we'll do is we'll use the engine dot say method or we can add this before the text.", 'start': 1055.731, 'duration': 8.541}, {'end': 1078.56, 'text': "So engine dot say will copy the same code again, right? So now what I'll do is I'll copy this over here.", 'start': 1065.072, 'duration': 13.488}, {'end': 1085.745, 'text': 'All right, and after this I will call engine dot run and wait.', 'start': 1080.021, 'duration': 5.724}, {'end': 1087.906, 'text': "So let's see how the output looks like guys.", 'start': 1086.425, 'duration': 1.481}, {'end': 1095.892, 'text': 'Hello, how are you doing? I am fine.', 'start': 1092.87, 'duration': 3.022}], 'summary': 'Using engine dot say method to generate output text.', 'duration': 44.982, 'max_score': 1050.91, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/0sjRkz1UIDQ/pics/0sjRkz1UIDQ1050910.jpg'}, {'end': 1186.937, 'src': 'embed', 'start': 1143.419, 'weight': 3, 'content': [{'end': 1151.345, 'text': "All right, so after that it's taking our voice and recognizing it and converting it into speech and after that, let's just call it again guys.", 'start': 1143.419, 'duration': 7.926}, {'end': 1162.814, 'text': "So, after getting this whole thing converted in text, let's just get another.", 'start': 1153.367, 'duration': 9.447}, {'end': 1165.116, 'text': 'what saying?', 'start': 1162.814, 'duration': 2.302}, {'end': 1173.125, 'text': 'something like what do you want me to do right?', 'start': 1165.116, 'duration': 8.009}, {'end': 1181.112, 'text': "and let's just run and wait right, and after this, let's copy this whole code again.", 'start': 1173.125, 'duration': 7.987}, {'end': 1186.937, 'text': 'Paste it over here.', 'start': 1186.036, 'duration': 0.901}], 'summary': 'Voice recognition and speech conversion demonstrated with code repetition.', 'duration': 43.518, 'max_score': 1143.419, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/0sjRkz1UIDQ/pics/0sjRkz1UIDQ1143419.jpg'}, {'end': 1260.808, 'src': 'embed', 'start': 1228.353, 'weight': 2, 'content': [{'end': 1231.034, 'text': 'because when I did it earlier it was working fine.', 'start': 1228.353, 'duration': 2.681}, {'end': 1233.315, 'text': "So let's just run it again.", 'start': 1231.755, 'duration': 1.56}, {'end': 1239.038, 'text': 'Hello How are you doing? I am doing good.', 'start': 1233.335, 'duration': 5.703}, {'end': 1243.601, 'text': 'What do you want me to do? I want you to listen.', 'start': 1240.619, 'duration': 2.982}, {'end': 1248.463, 'text': 'See this is how you can actually work with it.', 'start': 1246.142, 'duration': 2.321}, {'end': 1258.286, 'text': 'So, now that this is a conversation going on, guys, after that, what I can do here is add a few more stuff, like you know if block,', 'start': 1249.243, 'duration': 9.043}, {'end': 1260.808, 'text': 'and after which I can just give it a command.', 'start': 1258.286, 'duration': 2.522}], 'summary': 'Testing conversation with code example and commands.', 'duration': 32.455, 'max_score': 1228.353, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/0sjRkz1UIDQ/pics/0sjRkz1UIDQ1228353.jpg'}, {'end': 1347.488, 'src': 'embed', 'start': 1284.445, 'weight': 1, 'content': [{'end': 1290.059, 'text': 'So what this program would do is Take that cue from the part where music is said.', 'start': 1284.445, 'duration': 5.614}, {'end': 1295.804, 'text': 'it will eventually call that function which is going to call the function to play the music,', 'start': 1290.059, 'duration': 5.745}, {'end': 1299.086, 'text': 'and that is how we are going to automate these tasks all together.', 'start': 1295.804, 'duration': 3.282}, {'end': 1311.437, 'text': "All right guys, so let's try to see how we can incorporate different methods guys try this project file inside which I have web automation.", 'start': 1300.828, 'duration': 10.609}, {'end': 1314.559, 'text': 'So I have just made these are dummy.', 'start': 1311.497, 'duration': 3.062}, {'end': 1322.453, 'text': 'What do you call it? These are dummy functions that I will see if it actually works or if I can call these functions inside my program.', 'start': 1314.899, 'duration': 7.554}, {'end': 1330.639, 'text': "So from web automation I'm going to import everything right?", 'start': 1323.073, 'duration': 7.566}, {'end': 1347.488, 'text': "So now what I'll do is I'll use this recognized text and let's say if recognized Text is equal to, let's say music right or Let's say play music.", 'start': 1331.239, 'duration': 16.249}], 'summary': 'Automate tasks by calling functions for web automation and play music.', 'duration': 63.043, 'max_score': 1284.445, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/0sjRkz1UIDQ/pics/0sjRkz1UIDQ1284445.jpg'}, {'end': 1482.099, 'src': 'embed', 'start': 1450.073, 'weight': 0, 'content': [{'end': 1453.895, 'text': 'it should come back to the original function or it would just end the process.', 'start': 1450.073, 'duration': 3.822}, {'end': 1457.196, 'text': "All right, and we'll have to add triggers as well.", 'start': 1454.015, 'duration': 3.181}, {'end': 1459.457, 'text': 'So that we are going to do in the next session guys.', 'start': 1457.497, 'duration': 1.96}, {'end': 1462.459, 'text': 'And if you have any questions, you can mention them in the comment section below.', 'start': 1459.718, 'duration': 2.741}, {'end': 1468.968, 'text': "and now that we have come to the end of this session is, I'm sure most of you must having a lot of questions for me,", 'start': 1463.243, 'duration': 5.725}, {'end': 1470.689, 'text': 'so you can mention them in the comment section below.', 'start': 1468.968, 'duration': 1.721}, {'end': 1472.771, 'text': 'And so, if you are new here,', 'start': 1470.849, 'duration': 1.922}, {'end': 1482.099, 'text': "don't forget to subscribe to a takeoff or more exciting tutorials and also press the bell icon to get the latest updates on any Raycon and do check out any records.", 'start': 1472.771, 'duration': 9.328}], 'summary': 'Upcoming session will cover adding triggers; invite questions. subscribe for updates.', 'duration': 32.026, 'max_score': 1450.073, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/0sjRkz1UIDQ/pics/0sjRkz1UIDQ1450073.jpg'}], 'start': 988.556, 'title': 'Speech conversion and music playback', 'summary': 'Covers converting speech to text, text to speech, automating music playback, and handling errors in the program. it also discusses speech recognition, web automation, and the development of functions for music commands, emphasizing the need for triggers and separate blocks to avoid overlapping processes.', 'chapters': [{'end': 1260.808, 'start': 988.556, 'title': 'Text to speech conversion and conversation', 'summary': 'Demonstrates the process of converting speech to text and text to speech, incorporating both modules into a conversation, and handling errors in the program.', 'duration': 272.252, 'highlights': ['The process of converting speech to text and text to speech is demonstrated, involving initializing the engine and using the say method and run and wait function.', 'The program encounters errors in recognizing and converting speech, which are identified and rectified by modifying the code.', 'Incorporating if blocks into the conversation process is mentioned as a potential addition to the program.']}, {'end': 1488.304, 'start': 1261.308, 'title': 'Automating music playback with speech recognition', 'summary': 'Demonstrates the process of automating music playback using speech recognition and web automation, highlighting the development of functions for recognizing and executing music commands, as well as the need for triggers and separate blocks to avoid overlapping processes.', 'duration': 226.996, 'highlights': ["Functions are developed to recognize and execute music commands using speech and web automation. The program demonstrates the development of functions to recognize speech commands such as 'play music' and execute them using web automation, showcasing the integration of speech recognition and automation.", 'Importing web automation module and utilizing recognized text to call specific functions. The transcript showcases the process of importing a web automation module and using recognized text to call specific functions, exemplifying the practical application of speech recognition and automation in executing commands.', 'Need for triggers and separate blocks to avoid overlapping processes and manage task execution. The need for implementing triggers and separate blocks to manage task execution and prevent overlapping processes is emphasized, illustrating the importance of structuring the automation process to ensure efficient and organized task management.']}], 'duration': 499.748, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/0sjRkz1UIDQ/pics/0sjRkz1UIDQ988556.jpg', 'highlights': ['The need for triggers and separate blocks to avoid overlapping processes and manage task execution.', 'Functions are developed to recognize and execute music commands using speech and web automation.', 'Incorporating if blocks into the conversation process is mentioned as a potential addition to the program.', 'The program encounters errors in recognizing and converting speech, which are identified and rectified by modifying the code.', 'The process of converting speech to text and text to speech is demonstrated, involving initializing the engine and using the say method and run and wait function.', 'Importing web automation module and utilizing recognized text to call specific functions.']}], 'highlights': ["The protagonist in the Marvel Cinematic Universe's Iron Man uses an AI-powered bot called Jarvis to perform tasks on voice commands, showcasing futuristic capabilities.", 'The emergence of conversational AI boards capable of performing various tasks through speech recognition, such as playing music, providing movie recommendations, and retrieving weather reports, showcases the advancement in AI technology.', 'Mark Zuckerberg developed a personalized home assistant incorporating facial recognition, demonstrating an advanced approach utilizing object detection for the Jarvis project.', 'The program recognizes speech and converts it into text, enabling automated functions based on recognized input.', 'Utilizes the pi TTS x3 library for text-to-speech functionality, offering features like adjusting voice properties and volume control.', 'The need for triggers and separate blocks to avoid overlapping processes and manage task execution.', 'Functions are developed to recognize and execute music commands using speech and web automation.', 'The process of converting speech to text and text to speech is demonstrated, involving initializing the engine and using the say method and run and wait function.', 'The rapid development and application of AI and ML in speech recognition and automation has gained popularity and curiosity among developers and organizations, leading to breakthrough achievements.', 'The project objective is to build a speech-triggered bot using Python libraries for speech recognition and text-to-speech functions, with tasks including web browser automation, playing music, obtaining movie recommendations, retrieving weather reports, and translations.']}