[Get it solved] Demonstrate that you used “continue” and “break” ...

Check Out Our Work & Get Yours Done

Submit Work

Download Sample

Enroll in the complete course for only $250 USD*

Order Now

Submit work Offers

Demonstrate that you used “continue” and “break” to improve efficiency and explain why

data mining

Description

In this module, you will create 3 Python notebooks. In all your programs, use markdown cells and be creative of summarizing/commenting your notebooks. You will also add detailed comments in your Python code (using “#” or triple quote signs)

Notebook 1: Prime Numbers

1. Write a Jupyter Notebook to find the 9991th to 10000th prime numbers.

2. Display the numbers to the notebook

3. Output the numbers to a data file called prime.txt.

4. Demonstrate that you used “continue” and “break” to improve efficiency and explain why

Notebook 2: ROT

ROT is a very simple cipher that is used to information hiding (https://en.wikipedia.org/wiki/ROT13 (Links to an external site.)) . Read the wiki page to understand how the encoder works.

a. Create an “encode_rot()” function to encode any given strings using ROT algorithm. The input should contain a key and a string of text. The key can be any integers both negative and positive (-12: turn left 12 positions, 36: turn right 36 positions). Only alphabet letters are encoded.

The following two lines of your code will generate an output of “Ocejkpg ECP ngctp 2 !!!“.

clear_text=” Machine CAN learn 2 !!!”

encode_rot(clear_text, 28)

b. Create a decode_rot() to decode a ciphertext. The input only contains the ciphertext. The output contains the cleartext and the key that was used to encode text. The key will be between 0 and 25. (hint: Compare your decoded clear text with a dictionary text file and decide which one has the most dictionary words.)

The following two lines will generate an output of

The clear text is : “Data is like people, interrogate it hard enough and it will tell you whatever you want to hear.”
The key is 16

cipher_text= “Tqjq yi byau fuefbu, ydjuhhewqju yj xqht udekwx qdt yj mybb jubb oek mxqjuluh oek mqdj je xuqh.”

decode_rot(cipher_text)

c. Use your function to test more ciphertexts and show the results in your notebook . (you can use www.rot13.com (Links to an external site.) or your encode_rot() function generate cipher texts).

Notebook 3. Histogram of Top Words

Find the top frequently used words in the book of “Sense and Sensibility”. The book is in the sense_andsensibility.txt file.

1. The words should not be case sensitive, meaning “Mother” and “mother” are considered the same word.

2. Replace all the punctuation marks with a space.

3. Use the “stopwords.txt” file to remove all the stop words in text. (Do NOT modify the stopwords.txt file)

4. Create a histogram similar to the “histogram.jpg” file. The diagram should contain the ranking, the top 30 words, the number of times they appeared in the book. The number of stars will be the number of appearance divided by 10. For example, “mother” appears 263 times; there are 26 stars displayed. (You may not have the exactly the same result as in the histogram.jpg)

Submissions:

You will export your notebooks in both .html and .py formats. You will submit the following 6 files to Blackboard.

1. Firstname_Lastname_Notebook1.html

2. Firstname_Lastname_Notebook1.py

3. Firstname_Lastname_Notebook2.html

4. Firstname_Lastname_Notebook2.py

5. Firstname_Lastname_Notebook3.html

6. Firstname_Lastname_Notebook3.py

Related Questions in data mining category

Data mining in the Bank sector " research paper Title, I want to request writing data mining Techniques to add in the paper with references and useful explanation.

Data Collection Methods (graded) The choice of the data collection method depends on the specific information needed to answer the research question and the resources available to the researcher.

Implement L2 regularized linear regression algorithm with λ ranging from 0 to 150 (integers only). For each of the 6 dataset, plot both the training set MSE and the test set MSE as a function of λ (x-axis) in one graph.

Design Python code for text pre-processing (a) Parsing and tokenizing - read files from RCV1v2, find the documentID and record it to a collection of BowDocument Objects.

Using the data from Titanic with decision-tree mode, and cluster model to analyze the relationship between variables and survival rate in the Titanic event.

There is a separate workbook (Excel file) for each part of the exam. The core data is the same in each workbook.

Project Select a problem that can be solved using a database approach. This may be a real problem, or you may use ideas from the text or class to create your own problem to solve. You must use Access 2010 or Access 2013 to create the forms and reports.

XYZ Companies uses Data Mining

Data Vizualization Fundamentals: Downtown Office supply is a small family owned store in a small suburb.

Automatic Cluster Detection, please organize a short report organized

Get Higher Grades Now

Tutors Online

Description

Drop Files Here Or Click to Upload

Get Free Quote!

282 Experts Online

Connect With Us

We Provide Services Across The Globe

Disclaimer: The reference papers or solutions provided by Calltutors.com serve as model papers or solutions for students or professionals and are not to be submitted as it is to any institutions. These documents are intended to be used for research and reference purposes only. University and company's logo's are the property of respected owners. We don't have affiliation with the mentioned universities. By using our services means, you agree to our Honor Code , Privacy Policy , Terms & Conditions , Payment , Refund & Cancellation Policy.

Enroll in the complete course for only $250 USD*

Demonstrate that you used “continue” and “break” to improve efficiency and explain why

data mining

Description

Get instant assignment help service

Related Questions in data mining category

Policy

Exploring

Other

Connect With Us

We Provide Services Across The Globe