By Antonio Gulli

ISBN-10: 1517216710

ISBN-13: 9781517216719

BigData and laptop studying in Python and Spark

**Example text**

What is an odd ratio? Solution Given a probability the odd ratio simply identifies the chances for the event to happen over the chances for the event not to happen. The event is a function. If we take the logarithm, we have and we can thus map the range of probabilities [0, 1] into the full range. The concept of “odd ratio” is also useful to introduce “logistic regression” (the topic of the next question). Let us assume that we have a linear equation , we can imagine that is represented in terms of log odds so that which can be solved for as and the problem becomes the one of finding the lowest error for all training examples 37.

Can you provide an example of connection to the LinkedIn API? Solution Code 28. Can you provide an example of connection to the Facebook API? Solution Code 29. What is a TFxIDF? Solution Code 30. What is “features hashing”? And why is it useful for BigData? Solution 31. What is “continuous features binning”? Solution 32. What is an normalization? Solution Code 33. What is a Chi Square Selection? Solution 34. What is mutual information and how can it be used for features selection? Solution 35. What is a loss function, what are linear models, and what do we mean by regularization parameters in machine learning?

Can you read JSON into Python Pandas? Solution Code 20. Can you draw a function from Python? Solution Code 21. Can you represent a graph in Python? Solution Code 22. What is an Ipython notebook? Solution Code 23. What is a convenient tool for performing data statistics? Solution Code 24. How is it convenient to visualize data statistics Solution Code 25. How to compute covariance and correlation matrices with pandas Solution Code 26. Can you provide an example of connection to the Twitter API? Solution Code 27.

### A collection of Data Science Interview Questions Solved in Python and Spark: Hands-on Big Data and Machine Learning by Antonio Gulli

