bloominstituteoftechnology · timrocar · Sep 8, 2020 · Sep 9, 2020 · Sep 9, 2020 · Sep 10, 2020
diff --git a/.vscode/settings.json b/.vscode/settings.json
@@ -0,0 +1,3 @@
+{
+    "python.dataScience.jupyterServerURI": "local"
+}
diff --git a/Pipfile b/Pipfile
@@ -0,0 +1,13 @@
+[[source]]
+name = "pypi"
+url = "https://pypi.org/simple"
+verify_ssl = true
+
+[dev-packages]
+
+[packages]
+python-dotenv = "*"
+psycopg2-binary = "*"
+
+[requires]
+python_version = "3.8"
diff --git a/Pipfile.lock b/Pipfile.lock
diff --git a/Unit-3-Sprint-2-SQL-and-Databases-Study-Guide.md b/Unit-3-Sprint-2-SQL-and-Databases-Study-Guide.md
@@ -0,0 +1,118 @@
+# Unit 3 Sprint 2 SQL and Databases Study Guide
+
+This study guide should reinforce and provide practice for all of the concepts you have seen in the past week. There are a mix of written questions and coding exercises, both are equally important to prepare you for the sprint challenge as well as to be able to speak on these topics comfortably in interviews and on the job.
+
+If you get stuck or are unsure of something remember the 20 minute rule. If that doesn't help, then research a solution with [google](https://www.google.com) or [StackOverflow](https://www.stackoverflow.com). Only once you have exhausted these methods should you turn to your Team Lead - they won't be there on your SC or during an interview. That being said, don't hesitate to ask for help if you truly are stuck.
+
+Have fun studying!
+
+## SQL
+
+**Concepts:**
+
+1. What is SQL?
+2. What is a RDBMS?
+3. What is an ETL pipeline?
+4. What is a schema?
+5. What does each letter in ACID stand for? Give an explanation for each and why they matter?
+	- **A**
+	- **C**
+	- **I**
+	- **D**
+6. Explain each of the table relationships and give an example for each
+	- One-to-One
+	- One-to-Many
+	- Many-to-Many
+
+## Syntax
+For the following section, give a brief explanation of each of the SQL commands.
+
+1. **SELECT** - 
+2. **WHERE** - 
+3. **LIMIT** - 
+4. **ORDER** -
+5. **JOIN** -
+6. **CREATE TABLE** - 
+7. **INSERT** -
+8. **DISTINCT** -
+9. **GROUP BY** -
+10. **ORDER BY** -
+11. **AVG** - 
+12. **MAX** -
+13. **AS** -
+
+## Starting From Scratch
+Create a file named `study_part1.py` and complete the exercise below. The only library you should need to import is `sqlite3`. Don't forget to be PEP8 compliant!
+1. Create a new database file call `study_part1.sqlite3`
+2. Create a table with the following columns
+    ```
+    student - string
+    studied - string
+    grade - int
+    age - int
+    sex - string
+    ```
+
+3. Fill the table with the following data
+
+    ```
+    'Lion-O', 'True', 85, 24, 'Male'
+    'Cheetara', 'True', 95, 22, 'Female'
+    'Mumm-Ra', 'False', 65, 153, 'Male'
+    'Snarf', 'False', 70, 15, 'Male'
+    'Panthro', 'True', 80, 30, 'Male'
+    ```
+
+4. Save your data. You can check that everything is working so far if you can view the table and data in DBBrowser
+
+5. Write the following queries to check your work. Querie outputs should be formatted for readability, don't simply print a number to the screen with no explanation, add context.
+
+    ```
+    What is the average age? Expected Result - 48.8
+    What are the name of the female students? Expected Result - 'Cheetara'
+    How many students studied? Expected Results - 3
+    Return all students and all columns, sorted by student names in alphabetical order.
+    ```
+
+## Query All the Tables!
+
+### Setup
+Before we get started you'll need a few things.
+1. Download the [Chinook Database here](https://github.com/bundickm/Study-Guides/blob/master/data/Chinook_Sqlite.sqlite)
+2. The schema can be [found here](https://github.com/bundickm/Study-Guides/blob/master/data/Chinook%20Schema.png)
+3. Create a file named `study_part2.py` and complete the exercise below. The only library you should need to import is `sqlite3`. Don't forget to be PEP8 compliant!
+4. Add a connection to the chinook database so that you can answer the queries below.
+
+### Queries
+**Single Table Queries**
+1. Find the average invoice total for each customer, return the details for the first 5 ID's
+2. Return all columns in Customer for the first 5 customers residing in the United States
+3. Which employee does not report to anyone?
+4. Find the number of unique composers
+5. How many rows are in the Track table?
+
+**Joins**
+
+6. Get the name of all Black Sabbath tracks and the albums they came off of
+7. What is the most popular genre by number of tracks?
+8. Find all customers that have spent over $45
+9. Find the first and last name, title, and the number of customers each employee has helped. If the customer count is 0 for an employee, it doesn't need to be displayed. Order the employees from most to least customers.
+10. Return the first and last name of each employee and who they report to
+
+## NoSQL
+
+### Questions of Understanding
+
+1. What is a document store?
+
+2. What is a `key:value` pair? What data type in Python uses `key:value` pairs?
+
+3. Give an example of when it would be best to use a SQL Database and when it would be best to use a NoSQL Database
+
+4. What are some of the trade-offs between SQL and NoSQL?
+
+5. What does each letter in BASE stand for? Give an explanation for each and why they matter?
+    - B
+    - A
+    - S
+    - E
diff --git a/buddymove_holidayiq.py b/buddymove_holidayiq.py
diff --git a/module1-introduction-to-sql/Pipfile b/module1-introduction-to-sql/Pipfile
@@ -0,0 +1,11 @@
+[[source]]
+name = "pypi"
+url = "https://pypi.org/simple"
+verify_ssl = true
+
+[dev-packages]
+
+[packages]
+
+[requires]
+python_version = "3.8"
diff --git a/module1-introduction-to-sql/buddymove_holidayiq.py b/module1-introduction-to-sql/buddymove_holidayiq.py
@@ -0,0 +1,34 @@
+import sqlite3
+import pandas as pd
+
+conn = sqlite3.connect('buddymove_holidayiq.sqlite3')
+curs = conn.cursor()
+review = pd.read_csv('buddymove_holidayiq.csv')
+review.to_sql('review', con=conn, if_exists = 'replace')
+
+
+def execute_query(cursor, query):
+    cursor.execute(query)
+    result = cursor.fetchall()
+    print(result)
+
+
+ROW_COUNT= """ 
+SELECT COUNT(*) FROM review;
+"""
+
+
+print('Row Count:')
+execute_query(curs, ROW_COUNT)
+
+
+USER_COUNT = """
+SELECT COUNT(*)
+FROM review
+WHERE Nature >= 100
+AND Shopping >= 100;
+"""
+
+
+print('Users who love nature and shopping count:')
+execute_query(curs, USER_COUNT)
diff --git a/module1-introduction-to-sql/buddymove_holidayiq.sqlite3 b/module1-introduction-to-sql/buddymove_holidayiq.sqlite3
diff --git a/module1-introduction-to-sql/rpg_db_example.py b/module1-introduction-to-sql/rpg_db_example.py