title
How I use SQL as a Data Analyst

description
SQL for Data Science πŸ‘‰πŸΌ https://lukeb.co/SQLdataScience Coursera Plus (7-day free trial) πŸ‘‰πŸΌ https://lukeb.co/CourseraPlus Data Nerds!! Let's discuss how SQL is used in Data Science. For this, I cover the basics of how to use SQL; Additionally, we talk about popular database options (MySQL, PostgreSQL, SQLite, SQL Server, Oracle) and editors to use (VS Code, DBeaver, Access). Certificates & Courses ================================== Coursera Courses: πŸ“œ Google Data Analytics Certificate (START HERE) πŸ‘‰πŸΌ https://lukeb.co/GoogleCert πŸ’Ώ SQL for Data Science πŸ‘‰πŸΌ https://lukeb.co/SQLdataScience 🧾 Excel Skills for Business πŸ‘‰πŸΌΒ  https://lukeb.co/ExcelBusinessAnalyst 🐍 Python for Everybody πŸ‘‰πŸΌ https://lukeb.co/PythonForEverybody πŸ“Š Data Visualization with Tableau πŸ‘‰πŸΌΒ https://lukeb.co/Tableau_UCDavis πŸ΄β€β˜ οΈ Data Science: Foundations using R πŸ‘‰πŸΌ https://lukeb.co/RforDataScienceJH Coursera Plus Subscription (Monthly $59USD) πŸ‘‰πŸΌ https://lukeb.co/CourseraMonthly Coursera Plus Subscription (Annual $399USD) πŸ‘‰πŸΌ https://lukeb.co/CourseraYearly πŸ‘¨πŸΌβ€πŸ« All courses πŸ‘‰πŸΌ https://kit.co/lukebarousse/data-analytics-courses Build a Portfolio Online ================================== πŸ‘©πŸ»β€πŸ’»Build portfolio here πŸ‘‰πŸΌ http://hostinger.com/luke Rebate Code: "LUKE" My Portfolio πŸ‘‰πŸΌ https://lukebarousse.tech/ Books for Data Nerds ================================== πŸ“š Books I’ve read πŸ‘‰πŸΌ https://kit.co/lukebarousse/book-recommendations πŸ“— Data Science Must Read πŸ‘‰πŸΌ https://geni.us/StorytellingWithData πŸ“™ Tableau πŸ‘‰πŸΌ https://geni.us/tableau πŸ“˜ Power BIπŸ‘‰πŸΌ https://geni.us/powerbi πŸ“• Python πŸ‘‰πŸΌ https://geni.us/pythontricks Tech for Data Nerds ================================== βš™οΈ Tech I use πŸ‘‰πŸΌ https://kit.co/lukebarousse/computer-accessories πŸͺŸWindows on a Mac (Parallels VM) πŸ‘‰πŸΌ https://lukeb.co/ParallelsFreeTrial πŸ‘¨πŸΌβ€πŸ’» M1 Macbook Air (Mac of choice) πŸ‘‰πŸΌ https://geni.us/M1macAir8GB πŸ’» Dell XPS 13 (PC of choice) πŸ‘‰πŸΌ https://geni.us/DellNewXPS13 πŸ’» Asus Vivo Book (Lowest Cost PC) πŸ‘‰πŸΌ https://geni.us/AsusVivoBook15 πŸ’»Lenovo IdeaPad (Best Value PC)πŸ‘‰πŸΌ https://geni.us/LenovoIdeaPad15 Social Media / Contact Me ====================== πŸ™‹πŸΌβ€β™‚οΈNewsletter: https://www.lukebarousse.com/ πŸŒ„ Instagram: https://www.instagram.com/lukebarousse/ ⏰ TikTok: https://www.tiktok.com/@lukebarousse πŸ“˜ Facebook: https://www.facebook.com/datavizbyluke πŸ“₯ Business Inquiries: luke@lukebarousse.com 00:00 Intro 00:44 Types of analysis w/ SQL 02:34 What is SQL? 03:14 How to query w/ SQL? 05:25 What database to use? 06:12 NoSQL vs SQL 07:09 SQL Course Recommendation 08:16 Where to keep database? 09:27 Cloud database options 10:14 SQL editors 11: 57 How I use SQL! 14:16 Suprise ending As a member of the Amazon, Coursera, Hostinger, Parallels, and Interview Query Affiliate Programs, I earn a commission from qualifying purchases on the links above. It costs you nothing but helps me with content creation. #datanerd #dataanalyst #datascience

detail
{'title': 'How I use SQL as a Data Analyst', 'heatmap': [{'end': 180.552, 'start': 154.407, 'weight': 0.721}, {'end': 326.051, 'start': 283.916, 'weight': 0.898}, {'end': 421.47, 'start': 407.601, 'weight': 0.705}, {'end': 514.065, 'start': 444.235, 'weight': 0.833}, {'value': 0.8256315817938444, 'end_time': 514.0649999999999, 'start_time': 465.922}, {'end': 578.458, 'start': 555.007, 'weight': 0.715}, {'end': 688.825, 'start': 607.177, 'weight': 0.858}, {'end': 765.332, 'start': 738.731, 'weight': 0.817}], 'summary': 'Covers the significance of sql in data science, explores database options and syntax, database management options, and emphasizes the power of sql in data analysis, using a few hundred lines of sql to clean and query a large dataset.', 'chapters': [{'end': 339.476, 'segs': [{'end': 28.494, 'src': 'embed', 'start': 0.049, 'weight': 0, 'content': [{'end': 3.773, 'text': 'Every day we generate 2.5 quintillion bytes of data.', 'start': 0.049, 'duration': 3.724}, {'end': 9.479, 'text': "And that's the equivalent of every person in the world filling up an entire Excel file with data every day.", 'start': 3.833, 'duration': 5.646}, {'end': 10.68, 'text': "And that's where SQL comes in.", 'start': 9.619, 'duration': 1.061}, {'end': 14.183, 'text': "Developers use this tool to store a lot of the data that we're generating.", 'start': 10.76, 'duration': 3.423}, {'end': 20.89, 'text': 'And then data analysts, data scientists, and even data engineers go in and use SQL in order to access and analyze this data.', 'start': 14.323, 'duration': 6.567}, {'end': 25.212, 'text': "So let's dive in today to better understand how I use SQL for my job as a data analyst.", 'start': 20.99, 'duration': 4.222}, {'end': 28.494, 'text': 'what is SQL and its importance in working with other data science tools?', 'start': 25.212, 'duration': 3.282}], 'summary': '2.5 quintillion bytes of data generated daily, sql used to store and analyze it.', 'duration': 28.445, 'max_score': 0.049, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs49.jpg'}, {'end': 66.971, 'src': 'embed', 'start': 38.838, 'weight': 1, 'content': [{'end': 43.921, 'text': 'And without a doubt, SQL is the most important tool by those that work in the field of data science.', 'start': 38.838, 'duration': 5.083}, {'end': 50.024, 'text': "What better way to show this than by sharing how I use SQL in my job? And I find that it's used in two main ways.", 'start': 44.221, 'duration': 5.803}, {'end': 53.425, 'text': 'First is ad hoc analysis, and the second is data sharing.', 'start': 50.264, 'duration': 3.161}, {'end': 58.007, 'text': 'Ad hoc analysis of business data is typically performed to answer one-off questions.', 'start': 53.645, 'duration': 4.362}, {'end': 63.129, 'text': "For example, let's say, my coworker comes up to me and asks the following hey, What were sales last quarter?", 'start': 58.187, 'duration': 4.942}, {'end': 66.971, 'text': 'From here I can use SQL to dive into the data and investigate further.', 'start': 63.41, 'duration': 3.561}], 'summary': 'Sql is crucial in data science for ad hoc analysis and data sharing.', 'duration': 28.133, 'max_score': 38.838, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs38838.jpg'}, {'end': 100.93, 'src': 'embed', 'start': 75.154, 'weight': 3, 'content': [{'end': 81.116, 'text': 'Can you look into why this is? Now, I could potentially stay in SQL and do this type of analysis.', 'start': 75.154, 'duration': 5.962}, {'end': 84.797, 'text': "But I found I'm actually more comfortable using programming languages for this.", 'start': 81.496, 'duration': 3.301}, {'end': 89.12, 'text': 'So I could use something like Python or R to dive further and investigate the data.', 'start': 85.257, 'duration': 3.863}, {'end': 91.562, 'text': 'So moving on to that second aspect of data sharing.', 'start': 89.361, 'duration': 2.201}, {'end': 95.726, 'text': 'What happens now whenever you have a stakeholder that wants to dive into the data themselves?', 'start': 91.763, 'duration': 3.963}, {'end': 100.93, 'text': "I do expect data nerds to have the skill of SQL, but non-data nerds I don't expect this as much.", 'start': 96.166, 'duration': 4.764}], 'summary': 'Preference for using programming languages like python or r for data analysis over sql due to greater comfort. non-data nerds may not have sql skills.', 'duration': 25.776, 'max_score': 75.154, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs75154.jpg'}, {'end': 154.407, 'src': 'embed', 'start': 123.418, 'weight': 2, 'content': [{'end': 127.86, 'text': 'Do you mind building a dashboard for the entire sales team so that way everybody can monitor this?', 'start': 123.418, 'duration': 4.442}, {'end': 129.08, 'text': 'Once again,', 'start': 128.38, 'duration': 0.7}, {'end': 137.891, 'text': 'SQL can be used in visualization tools such as Power BI and Tableau to pull the data real time and display it in a dashboard for my coworkers to admire.', 'start': 129.08, 'duration': 8.811}, {'end': 142.696, 'text': 'And this is why SQL is, in my opinion, the most powerful tool to know as a data analyst.', 'start': 138.151, 'duration': 4.545}, {'end': 146.76, 'text': 'not only can i use this to access and analyze data,', 'start': 143.417, 'duration': 3.343}, {'end': 154.407, 'text': 'i can take it a step further and put it into other data analytical tools to thus extract the data and provide it to my stakeholders.', 'start': 146.76, 'duration': 7.647}], 'summary': 'Sql enables real-time data visualization in tools like power bi and tableau, making it essential for data analysts.', 'duration': 30.989, 'max_score': 123.418, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs123418.jpg'}, {'end': 180.552, 'src': 'heatmap', 'start': 154.407, 'weight': 0.721, 'content': [{'end': 161.413, 'text': "so now let's dive deeper into understanding what sql actually is, and this stands for structured query language,", 'start': 154.407, 'duration': 7.006}, {'end': 171.725, 'text': "and it's most commonly pronounced sql, sql, sql, sql, sql, sql, SQL is the language used to query a database.", 'start': 161.413, 'duration': 10.312}, {'end': 176.248, 'text': 'Basically think of it like a common language between your computer and a database.', 'start': 172.105, 'duration': 4.143}, {'end': 180.552, 'text': 'So I frequently get asked how do I install SQL on my computer?', 'start': 176.529, 'duration': 4.023}], 'summary': 'Sql, or structured query language, is the common language used to query a database.', 'duration': 26.145, 'max_score': 154.407, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs154407.jpg'}, {'end': 326.051, 'src': 'heatmap', 'start': 283.916, 'weight': 0.898, 'content': [{'end': 291.239, 'text': 'such in this case where we can add on a where statement to filter our data further, based on how many users commented more than 100 times.', 'start': 283.916, 'duration': 7.323}, {'end': 296.201, 'text': 'We can also use SQL to insert or store more information in a database.', 'start': 292.079, 'duration': 4.122}, {'end': 304.324, 'text': 'For this, we can identify the table and columns we want to insert into and then provide the values to insert into this table.', 'start': 296.361, 'duration': 7.963}, {'end': 306.544, 'text': 'Once I run this, the table will be updated.', 'start': 304.544, 'duration': 2}, {'end': 309.626, 'text': 'And then finally, we can use SQL to update record entries.', 'start': 306.685, 'duration': 2.941}, {'end': 312.087, 'text': "For this, let's say we have a new comment on one of my videos.", 'start': 309.786, 'duration': 2.301}, {'end': 315.308, 'text': 'Once again, we can identify the table we want to update,', 'start': 312.347, 'duration': 2.961}, {'end': 323.811, 'text': 'the column we want to set a new value for and then a where statement to select what to filter on for that entry.', 'start': 315.888, 'duration': 7.923}, {'end': 326.051, 'text': 'Once I run this, the table will be updated.', 'start': 324.051, 'duration': 2}], 'summary': 'Using sql to filter, insert, and update data in database tables.', 'duration': 42.135, 'max_score': 283.916, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs283916.jpg'}], 'start': 0.049, 'title': "Sql's role in data science", 'summary': 'Emphasizes the significance of sql in data science, covering its role in data storage, analysis, and visualization, and explaining the basics of sql and databases, including its functions in data analysis and ad hoc analysis.', 'chapters': [{'end': 142.696, 'start': 0.049, 'title': 'Importance of sql in data science', 'summary': 'Highlights the significance of sql in data science, emphasizing its role in data storage, analysis, and visualization, while showcasing its relevance through examples of ad hoc analysis, data sharing, and utilization in visualization tools.', 'duration': 142.647, 'highlights': ['SQL is crucial in storing and analyzing the 2.5 quintillion bytes of data generated daily, serving as a primary tool for developers, data analysts, scientists, and engineers. It facilitates ad hoc analysis and data sharing, enabling the investigation of one-off questions and live connections to databases for non-technical stakeholders.', "The speaker demonstrates the use of programming languages like Python or R for in-depth analysis beyond ad hoc queries, showcasing SQL's complementarity with other data science tools.", "SQL's versatility extends to visualization tools like Power BI and Tableau, enabling real-time data pulling and display in dashboards for broader accessibility and monitoring, underscoring its pivotal role in data visualization within data science.", 'The chapter emphasizes SQL as the most important tool for individuals in the field of data science, highlighting its significant impact on tasks such as ad hoc analysis, data sharing, and data visualization, reaffirming its relevance and indispensability for data analysts.']}, {'end': 339.476, 'start': 143.417, 'title': 'Introduction to sql and database', 'summary': 'Explains the basics of sql and databases, emphasizing the importance of sql as a language for querying databases, the components required to run sql, the structure and syntax of sql queries, and various functions of sql in data analysis, including filtering, inserting, and updating data.', 'duration': 196.059, 'highlights': ['SQL is the language used to query a database, serving as a common language between computers and databases. ', 'The components required to run SQL are an editor and a database, with databases being organized collections of data used by large companies to store vast amounts of information. ', 'The structure and syntax of SQL to query a database is simple, involving select and from statements to identify the column of interest and the table to connect to, respectively. ', 'SQL can be used to analyze data, filter data based on specific criteria, insert or store information in a database, and update record entries. ', 'A survey by Stack Overflow conducted last year revealed the software tools used for jobs by respondents. ']}], 'duration': 339.427, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs49.jpg', 'highlights': ['SQL is crucial in storing and analyzing the 2.5 quintillion bytes of data generated daily, serving as a primary tool for developers, data analysts, scientists, and engineers.', 'The chapter emphasizes SQL as the most important tool for individuals in the field of data science, highlighting its significant impact on tasks such as ad hoc analysis, data sharing, and data visualization, reaffirming its relevance and indispensability for data analysts.', "SQL's versatility extends to visualization tools like Power BI and Tableau, enabling real-time data pulling and display in dashboards for broader accessibility and monitoring, underscoring its pivotal role in data visualization within data science.", "The speaker demonstrates the use of programming languages like Python or R for in-depth analysis beyond ad hoc queries, showcasing SQL's complementarity with other data science tools."]}, {'end': 505.238, 'segs': [{'end': 380.874, 'src': 'embed', 'start': 356.502, 'weight': 0, 'content': [{'end': 363.005, 'text': "I'd focus on the relational databases as they typically store data in a tabular form, which is more common for our job.", 'start': 356.502, 'duration': 6.503}, {'end': 364.886, 'text': 'But what about those NoSQL databases?', 'start': 363.145, 'duration': 1.741}, {'end': 373.67, 'text': 'So NoSQL actually stands for Not Only SQL, meaning that you can not only store this data in a tabular form.', 'start': 365.206, 'duration': 8.464}, {'end': 376.771, 'text': 'it also has a manner for you to store it in an unstructured manner.', 'start': 373.67, 'duration': 3.101}, {'end': 380.874, 'text': 'which is actually more efficient at handling very large amounts of data.', 'start': 377.131, 'duration': 3.743}], 'summary': 'Relational databases store tabular data, while nosql can handle unstructured data, more efficient for large data.', 'duration': 24.372, 'max_score': 356.502, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs356502.jpg'}, {'end': 426.051, 'src': 'heatmap', 'start': 392.805, 'weight': 1, 'content': [{'end': 395.588, 'text': "Well, quite honestly, I don't think it matters,", 'start': 392.805, 'duration': 2.783}, {'end': 401.354, 'text': 'as all of these relational databases use the same SQL syntax and you can transfer your skills between them.', 'start': 395.588, 'duration': 5.766}, {'end': 407.321, 'text': 'But interesting enough, the most popular options on that Stack Overflow list are also free and open source.', 'start': 401.595, 'duration': 5.726}, {'end': 409.403, 'text': 'So I would say start with any one of these.', 'start': 407.601, 'duration': 1.802}, {'end': 413.768, 'text': 'Personally, I use Postgres and also SQLite for all my consulting projects.', 'start': 409.703, 'duration': 4.065}, {'end': 421.47, 'text': 'other options are closed source and not always free, although they are very popular, especially being used in large corporations.', 'start': 414.128, 'duration': 7.342}, {'end': 426.051, 'text': "of these options, i've used primarily sql server for all of my work projects.", 'start': 421.47, 'duration': 4.581}], 'summary': 'Relational databases use same sql syntax; popular options are free and open source, like postgres and sqlite.', 'duration': 33.246, 'max_score': 392.805, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs392805.jpg'}, {'end': 475.565, 'src': 'embed', 'start': 449.157, 'weight': 2, 'content': [{'end': 453.658, 'text': 'which is perfect for those that have no prior experience with coding, that want to master SQL.', 'start': 449.157, 'duration': 4.501}, {'end': 458.5, 'text': "I even think it's great for those that have already obtained, maybe, the Google Data Analytics certificate,", 'start': 453.738, 'duration': 4.762}, {'end': 463.921, 'text': 'because now that you have these basics of SQL, you can use this specialization to dive further into it.', 'start': 458.5, 'duration': 5.421}, {'end': 465.922, 'text': 'The course focuses on SQLite,', 'start': 464.021, 'duration': 1.901}, {'end': 475.565, 'text': "and what I like about the platform that Coursera provides is that they get you up and running with the same tools they'll be using in real life without having to install any software.", 'start': 465.922, 'duration': 9.643}], 'summary': 'A course focused on sqlite, beneficial for beginners and google data analytics certificate holders to master sql, using real-life tools without installation.', 'duration': 26.408, 'max_score': 449.157, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs449157.jpg'}], 'start': 339.656, 'title': 'Database options & sql syntax', 'summary': 'Explores relational and non-relational databases, ease of switching between sql databases, free and open source options, and recommends the sql for data science specialization on coursera for mastering sql, particularly for those with no prior coding experience.', 'chapters': [{'end': 505.238, 'start': 339.656, 'title': 'Database options & sql syntax', 'summary': 'Discusses the popularity of relational and non-relational databases, the ease of switching between sql databases, the free and open source options, and recommends the sql for data science specialization on coursera for mastering sql, especially for those with no prior coding experience.', 'duration': 165.582, 'highlights': ['The most popular database options include relational databases (SQL) and non-relational databases (NoSQL), with NoSQL being more efficient at handling large amounts of data. Emphasizes the popularity of relational and non-relational databases and the efficiency of NoSQL for handling large data sets.', 'Relational databases use the same SQL syntax, making it easy to transfer skills between them, and the most popular options on Stack Overflow list are free and open source. Highlights the ease of transferring skills between relational databases and the availability of free and open source options.', 'Recommends the SQL for Data Science specialization on Coursera for mastering SQL, especially for those with no prior coding experience, and mentions the practical aspects of the specialization such as capstone projects and real-life tools. Recommends the SQL for Data Science specialization on Coursera and emphasizes its practical aspects for those with no coding experience.']}], 'duration': 165.582, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs339656.jpg', 'highlights': ['Emphasizes the popularity of relational and non-relational databases and the efficiency of NoSQL for handling large data sets.', 'Highlights the ease of transferring skills between relational databases and the availability of free and open source options.', 'Recommends the SQL for Data Science specialization on Coursera and emphasizes its practical aspects for those with no coding experience.']}, {'end': 712.981, 'segs': [{'end': 549.765, 'src': 'embed', 'start': 524.052, 'weight': 0, 'content': [{'end': 529.956, 'text': 'So in that case, I can download a copy of that database and have it running locally on my computer.', 'start': 524.052, 'duration': 5.904}, {'end': 532.558, 'text': 'I do this most frequently with Postgres databases.', 'start': 530.116, 'duration': 2.442}, {'end': 538.322, 'text': 'So because of this, I have the Postgres application installed on my computer so I can run these databases locally.', 'start': 532.738, 'duration': 5.584}, {'end': 540.923, 'text': 'This will be the same for any other popular database.', 'start': 538.422, 'duration': 2.501}, {'end': 545.564, 'text': "Now running a database locally on a computer is great option if you're the only person accessing it.", 'start': 541.163, 'duration': 4.401}, {'end': 549.765, 'text': 'Once you get into other people accessing it, you need to look at other options.', 'start': 546.024, 'duration': 3.741}], 'summary': 'Frequently downloads and runs postgres databases locally on computer, suitable for single user access.', 'duration': 25.713, 'max_score': 524.052, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs524052.jpg'}, {'end': 584.562, 'src': 'heatmap', 'start': 555.007, 'weight': 0.715, 'content': [{'end': 559.308, 'text': 'For the server option, many large companies have their own where they like to store their own data.', 'start': 555.007, 'duration': 4.301}, {'end': 565.63, 'text': 'Typically your IT department will handle installing and setting up all the dependencies for the database and then grant you access.', 'start': 559.548, 'duration': 6.082}, {'end': 567.771, 'text': 'This option is called on-prem.', 'start': 566.13, 'duration': 1.641}, {'end': 571.654, 'text': 'Now, another popular option is to use a cloud provider instead.', 'start': 567.991, 'duration': 3.663}, {'end': 578.458, 'text': 'Looking back at the Stack Overflow poll, the most popular options are from Amazon, Google, Microsoft, and even Heroku.', 'start': 571.934, 'duration': 6.524}, {'end': 584.562, 'text': "Personally, I've used Google Cloud Platform in the past, but many other options are just as good, especially AWS.", 'start': 578.758, 'duration': 5.804}], 'summary': 'Large companies prefer on-prem servers, while popular cloud providers include amazon, google, microsoft, and heroku, with aws being particularly popular.', 'duration': 29.555, 'max_score': 555.007, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs555007.jpg'}, {'end': 695.009, 'src': 'heatmap', 'start': 599.032, 'weight': 1, 'content': [{'end': 606.897, 'text': 'However, it is important to know that these cloud providers provide some sort of interface for you to run SQL queries for your databases.', 'start': 599.032, 'duration': 7.865}, {'end': 615.203, 'text': 'As an example, in the Google Data Analytics certificate, students use BigQuery inside of the Google Cloud Platform to run their SQL queries.', 'start': 607.177, 'duration': 8.026}, {'end': 619.786, 'text': 'And this actually leads into the next area of where you should be writing and running your SQL queries.', 'start': 615.443, 'duration': 4.343}, {'end': 625.268, 'text': 'Well, you could use a cloud platform in order to run your SQLs on those databases in the cloud.', 'start': 619.966, 'duration': 5.302}, {'end': 629.309, 'text': "But what happens if you have a different environment or your database isn't in the cloud?", 'start': 625.348, 'duration': 3.961}, {'end': 634.53, 'text': 'All the major relational databases actually have their own editors for them.', 'start': 629.549, 'duration': 4.981}, {'end': 639.692, 'text': "I've used Postgres before, and for this, I've used their SQL management software of pgAdmin.", 'start': 634.73, 'duration': 4.962}, {'end': 644.353, 'text': 'MySQL also has a similar software called MySQL Workbench to run their databases.', 'start': 640.012, 'duration': 4.341}, {'end': 649.834, 'text': "And then from my work with SQL Server, I've even used Microsoft's SQL Server Management Studio.", 'start': 644.633, 'duration': 5.201}, {'end': 653.575, 'text': 'One note is that this tool is specifically only available for Windows users.', 'start': 650.154, 'duration': 3.421}, {'end': 659.596, 'text': "But what happens if you're like me and you're always switching between different databases? For this, I like to use VS Code.", 'start': 653.895, 'duration': 5.701}, {'end': 663.737, 'text': 'VS Code has many free extensions in order to support your database of choice.', 'start': 659.736, 'duration': 4.001}, {'end': 671.399, 'text': "I'm currently checking out this extension called MySQL, which contrary to its name, actually supports a lot of the different popular databases.", 'start': 664.477, 'duration': 6.922}, {'end': 673.92, 'text': 'Another popular option to check out is dBeaver.', 'start': 671.779, 'duration': 2.141}, {'end': 679.042, 'text': "So I haven't used this one personally, but it does seem like a good option as recommended by other data analysts.", 'start': 674.06, 'duration': 4.982}, {'end': 682.963, 'text': "Just like VS Code, it's free and supports a variety of different databases.", 'start': 679.382, 'duration': 3.581}, {'end': 688.825, 'text': "Now there is one option that I don't suggest learning for those new to data analytics, and that is Microsoft Access.", 'start': 683.263, 'duration': 5.562}, {'end': 695.009, 'text': "It's sort of a weird in-between of a relational database and also a graphical user interface that you can build apps.", 'start': 689.105, 'duration': 5.904}], 'summary': 'Cloud providers offer interfaces for running sql queries. also, there are various tools like bigquery, pgadmin, mysql workbench, sql server management studio, vs code, dbeaver, and microsoft access for running and managing databases.', 'duration': 95.977, 'max_score': 599.032, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs599032.jpg'}, {'end': 671.399, 'src': 'embed', 'start': 640.012, 'weight': 2, 'content': [{'end': 644.353, 'text': 'MySQL also has a similar software called MySQL Workbench to run their databases.', 'start': 640.012, 'duration': 4.341}, {'end': 649.834, 'text': "And then from my work with SQL Server, I've even used Microsoft's SQL Server Management Studio.", 'start': 644.633, 'duration': 5.201}, {'end': 653.575, 'text': 'One note is that this tool is specifically only available for Windows users.', 'start': 650.154, 'duration': 3.421}, {'end': 659.596, 'text': "But what happens if you're like me and you're always switching between different databases? For this, I like to use VS Code.", 'start': 653.895, 'duration': 5.701}, {'end': 663.737, 'text': 'VS Code has many free extensions in order to support your database of choice.', 'start': 659.736, 'duration': 4.001}, {'end': 671.399, 'text': "I'm currently checking out this extension called MySQL, which contrary to its name, actually supports a lot of the different popular databases.", 'start': 664.477, 'duration': 6.922}], 'summary': 'Vs code supports various databases with free extensions, including mysql.', 'duration': 31.387, 'max_score': 640.012, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs640012.jpg'}, {'end': 710.28, 'src': 'embed', 'start': 679.382, 'weight': 3, 'content': [{'end': 682.963, 'text': "Just like VS Code, it's free and supports a variety of different databases.", 'start': 679.382, 'duration': 3.581}, {'end': 688.825, 'text': "Now there is one option that I don't suggest learning for those new to data analytics, and that is Microsoft Access.", 'start': 683.263, 'duration': 5.562}, {'end': 695.009, 'text': "It's sort of a weird in-between of a relational database and also a graphical user interface that you can build apps.", 'start': 689.105, 'duration': 5.904}, {'end': 701.974, 'text': 'We used this app while I was in the Navy in order to store our data and the data nerds that worked with this were not really a fan of it.', 'start': 695.349, 'duration': 6.625}, {'end': 710.28, 'text': "Now, the reason why I'm recommending not learning this as a new data analyst is because I feel Microsoft is actually signaling that they're going to be shutting this product down.", 'start': 702.134, 'duration': 8.146}], 'summary': 'Vs code, free, supports various databases. avoid learning microsoft access due to potential shutdown.', 'duration': 30.898, 'max_score': 679.382, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs679382.jpg'}], 'start': 505.499, 'title': 'Database management options', 'summary': "Covers various database management options, such as local installation, cloud providers like amazon, google, microsoft, and heroku, and tools like pgadmin, mysql workbench, microsoft's sql server management studio, vs code, and dbeaver.", 'chapters': [{'end': 712.981, 'start': 505.499, 'title': 'Options for running and managing databases', 'summary': "Discusses options for running and managing databases, including local installation, cloud providers like amazon, google, microsoft, and heroku, and tools like pgadmin, mysql workbench, microsoft's sql server management studio, vs code, and dbeaver.", 'duration': 207.482, 'highlights': ['Running a database locally on a computer is a great option, especially for learning or new to SQL, and popular databases like Postgres have their own management software like pgAdmin. (Relevance score: 5)', 'Cloud providers like Amazon, Google, Microsoft, and Heroku are popular options for running databases, and Google Cloud Platform provides BigQuery for running SQL queries. (Relevance score: 4)', "Tools like pgAdmin, MySQL Workbench, Microsoft's SQL Server Management Studio, VS Code, and dBeaver are available for running and managing different popular databases. (Relevance score: 3)", 'Microsoft Access is not recommended for new data analysts due to its likely shutdown, and it is an in-between of a relational database and a graphical user interface for building apps. (Relevance score: 2)']}], 'duration': 207.482, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs505499.jpg', 'highlights': ['Running a database locally on a computer is a great option, especially for learning or new to SQL, and popular databases like Postgres have their own management software like pgAdmin.', 'Cloud providers like Amazon, Google, Microsoft, and Heroku are popular options for running databases, and Google Cloud Platform provides BigQuery for running SQL queries.', "Tools like pgAdmin, MySQL Workbench, Microsoft's SQL Server Management Studio, VS Code, and dBeaver are available for running and managing different popular databases.", 'Microsoft Access is not recommended for new data analysts due to its likely shutdown, and it is an in-between of a relational database and a graphical user interface for building apps.']}, {'end': 924.006, 'segs': [{'end': 773.415, 'src': 'heatmap', 'start': 730.869, 'weight': 0, 'content': [{'end': 737.211, 'text': 'The problem is that the data was spread out across many tables within our database and required extensive cleaning.', 'start': 730.869, 'duration': 6.342}, {'end': 738.571, 'text': 'This was a large set of data.', 'start': 737.331, 'duration': 1.24}, {'end': 745.635, 'text': 'So a teammate and I worked together in order to build a SQL query to complying all these different tables that we needed and clean up the data.', 'start': 738.731, 'duration': 6.904}, {'end': 752.582, 'text': 'This portion of the project ended up taking a few weeks for us to actually develop these few hundred lines of SQL in order to query this data.', 'start': 745.755, 'duration': 6.827}, {'end': 759.628, 'text': 'Now that we had this cleaned up data set, my boss now was like hey, I need this data set in a more presentable form,', 'start': 752.862, 'duration': 6.766}, {'end': 762.591, 'text': 'so that way my teammates can use it and also her boss could access it.', 'start': 759.628, 'duration': 2.963}, {'end': 765.332, 'text': "So that's where something like Power BI comes in.", 'start': 762.771, 'duration': 2.561}, {'end': 773.415, 'text': 'In this case, I can use the tool within it of Power Query, which is also in Excel, to actually go in and execute the SQL code we had written.', 'start': 765.732, 'duration': 7.683}], 'summary': 'Cleaned and queried large dataset using sql, took weeks to develop. used power bi for presenting data.', 'duration': 42.546, 'max_score': 730.869, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs730869.jpg'}, {'end': 805.373, 'src': 'embed', 'start': 781.639, 'weight': 1, 'content': [{'end': 789.502, 'text': 'From here, I developed an end to end solution that allowed the stakeholders to now go to a dashboard and access the data that they were curious about.', 'start': 781.639, 'duration': 7.863}, {'end': 793.765, 'text': 'Now to be clear, this SQL query functionality is not specific just to Power BI.', 'start': 789.682, 'duration': 4.083}, {'end': 798.408, 'text': 'Other popular tools like Tableau and Google Data Studio also have this functionality.', 'start': 793.905, 'duration': 4.503}, {'end': 805.373, 'text': "Now, not related to this project, but I've also used other tools such as programming languages to utilize SQL as well.", 'start': 798.468, 'duration': 6.905}], 'summary': 'Developed end-to-end solution for stakeholders to access data via dashboard, applicable to power bi, tableau, and google data studio.', 'duration': 23.734, 'max_score': 781.639, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs781639.jpg'}, {'end': 856.7, 'src': 'embed', 'start': 818.041, 'weight': 3, 'content': [{'end': 823.607, 'text': "And then from there use Python's tools and libraries such as pandas to clean up the data,", 'start': 818.041, 'duration': 5.566}, {'end': 828.633, 'text': 'maybe matplotlib to plot it and even maybe something like sklearn to perform some regression analysis.', 'start': 823.607, 'duration': 5.026}, {'end': 833.138, 'text': 'Now all these things with Python, you can also do with other popular languages such as R also.', 'start': 829.033, 'duration': 4.105}, {'end': 836.702, 'text': 'So I share all these examples, not for you to think that you have to learn all these different tools.', 'start': 833.298, 'duration': 3.404}, {'end': 840.205, 'text': 'But instead, I want you to understand the power of SQL.', 'start': 837.222, 'duration': 2.983}, {'end': 846.35, 'text': "It's such a powerful language and has such a powerful use that other tools are taking advantage of it.", 'start': 840.625, 'duration': 5.725}, {'end': 851.695, 'text': 'Because of this, I feel that SQL is the most powerful tool to learn as a data analyst.', 'start': 846.491, 'duration': 5.204}, {'end': 855.078, 'text': 'As always, if you got value out of this video, smash that like button.', 'start': 852.336, 'duration': 2.742}, {'end': 856.7, 'text': "And with that, I'll see you in the next one.", 'start': 855.259, 'duration': 1.441}], 'summary': "Python, sql, and other tools used for data analysis, emphasizing sql's power and importance for data analysts.", 'duration': 38.659, 'max_score': 818.041, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs818041.jpg'}], 'start': 712.981, 'title': 'The power of sql in data analysis', 'summary': 'Emphasizes the use of sql in a data analysis project, involving the use of a few hundred lines of sql to clean and query a large dataset. it underscores the importance of sql as a powerful tool for data analysts.', 'chapters': [{'end': 924.006, 'start': 712.981, 'title': 'Power of sql in data analysis', 'summary': 'Highlights the use of sql in a data analysis project, where a few hundred lines of sql were used to clean and query a large dataset, and the importance of sql as a powerful tool for data analysts.', 'duration': 211.025, 'highlights': ['SQL query took a few weeks to develop for cleaning and querying a large dataset A few weeks were spent developing a few hundred lines of SQL to clean and query a large dataset.', "Power BI used to make the data presentable and accessible to stakeholders Power BI, including the Power Query tool, was used to create a presentable and real-time connected dashboard for stakeholders' access.", 'SQL functionality not specific to Power BI but also available in other popular tools like Tableau and Google Data Studio The SQL query functionality is not limited to Power BI and is also available in other popular tools like Tableau and Google Data Studio.', 'Python used to import and clean data, and perform regression analysis Python was used to import and clean data, as well as to perform regression analysis, demonstrating its versatility in data analysis.', 'Importance of SQL as a powerful tool for data analysts The speaker emphasizes the power and importance of SQL as a tool for data analysts, advocating its significance in the field.']}], 'duration': 211.025, 'thumbnail': 'https://coursnap.oss-ap-southeast-1.aliyuncs.com/video-capture/GEBzsz8ZSXs/pics/GEBzsz8ZSXs712981.jpg', 'highlights': ['A few weeks were spent developing a few hundred lines of SQL to clean and query a large dataset.', "Power BI, including the Power Query tool, was used to create a presentable and real-time connected dashboard for stakeholders' access.", 'The SQL query functionality is not limited to Power BI and is also available in other popular tools like Tableau and Google Data Studio.', 'Python was used to import and clean data, as well as to perform regression analysis, demonstrating its versatility in data analysis.', 'The speaker emphasizes the power and importance of SQL as a tool for data analysts, advocating its significance in the field.']}], 'highlights': ['SQL is crucial in storing and analyzing the 2.5 quintillion bytes of data generated daily, serving as a primary tool for developers, data analysts, scientists, and engineers.', 'The chapter emphasizes SQL as the most important tool for individuals in the field of data science, highlighting its significant impact on tasks such as ad hoc analysis, data sharing, and data visualization, reaffirming its relevance and indispensability for data analysts.', 'The speaker emphasizes the power and importance of SQL as a tool for data analysts, advocating its significance in the field.', "SQL's versatility extends to visualization tools like Power BI and Tableau, enabling real-time data pulling and display in dashboards for broader accessibility and monitoring, underscoring its pivotal role in data visualization within data science.", 'Emphasizes the popularity of relational and non-relational databases and the efficiency of NoSQL for handling large data sets.', 'Highlights the ease of transferring skills between relational databases and the availability of free and open source options.', 'Recommends the SQL for Data Science specialization on Coursera and emphasizes its practical aspects for those with no coding experience.', 'Running a database locally on a computer is a great option, especially for learning or new to SQL, and popular databases like Postgres have their own management software like pgAdmin.', 'Cloud providers like Amazon, Google, Microsoft, and Heroku are popular options for running databases, and Google Cloud Platform provides BigQuery for running SQL queries.', "Tools like pgAdmin, MySQL Workbench, Microsoft's SQL Server Management Studio, VS Code, and dBeaver are available for running and managing different popular databases.", 'Microsoft Access is not recommended for new data analysts due to its likely shutdown, and it is an in-between of a relational database and a graphical user interface for building apps.', 'A few weeks were spent developing a few hundred lines of SQL to clean and query a large dataset.', "Power BI, including the Power Query tool, was used to create a presentable and real-time connected dashboard for stakeholders' access.", 'The SQL query functionality is not limited to Power BI and is also available in other popular tools like Tableau and Google Data Studio.', 'Python was used to import and clean data, as well as to perform regression analysis, demonstrating its versatility in data analysis.']}