Posts

Recognizing Handwritten Digits with scikit-learn

Image
  Recognizing handwritten text is a problem that can be traced back to the first automatic machines that needed to recognize individual characters in handwritten documents. Think about, for example, the ZIP codes on letters at the post office and the automation needed to recognize these five digits. Perfect recognition of these codes is necessary in order to sort mail automatically and efficiently. Included among the other applications that may come to mind is OCR (Optical Character Recognition) software. OCR software must read the handwritten text, or pages of printed books, for general electronic documents in which each character is well defined Hypothesis : The Digits data set of the Scikit-learn library provides numerous data sets that are useful for testing many problems of data analysis and prediction of the results. Some Scientist claims that it predicts the digit accurately 95% of the time. Perform data analysis to accept or reject this Hypothesis. Step 1: Dataset  In ...

Performing Analysis of Meteorological Data

Image
In this blog, we will discuss a data analysis which is based on the following dataset. “Has the Apparent temperature and humidity compared monthly across 10 years of the data indicate an increase due to Global warming”  following is the Hypothesis for the analysis? The Hypothesis means we need to find whether the average Apparent temperature for the month of a month says April starting from 2006 to 2016 and whether the average humidity for the same period has increased or not. This monthly analysis has to be done for all 12 months over the 10-year period. So you are basically resampling your data from hourly to monthly, then comparing the same month over the 10-year period. Support your analysis by appropriate visualizations using matplotlib and/or seaborn library. Step 1: Importing Libraries  Step 2: Importing Dataset Step 3: Dataset Description Step 4: Data Cleaning We have to find the null values in the dataset In this step we will prepare our data for the pl...