Including methods implemented during EDA and data lacking. Lack of Training Data (数据短缺) --- Data Augmentation (数据增强) Expand an input dataset by slightly changing the existing (orig
Notebook archive
Loose pages, technical margins, and small notes gathered into a readable stack.
Including methods implemented during EDA and data lacking. Lack of Training Data (数据短缺) --- Data Augmentation (数据增强) Expand an input dataset by slightly changing the existing (orig
Different data types in ML. Interval Data This is numerical data which has proper order and the exact zero . Temperature, time, credit score, pH... (zero value exists) Ratio Data T
How to calculate mathematics in python 1. covariance np.cov(x1, x2) 2. correlation
Applications in tensorflow and pytorch. Tensorflow 1. transform "pandas.dataframe" to "tensorflow.dataset" For training dataset, you should set the "label" to be the prediction lab
How to plot charts for dataset Plotting --- During a competition, there must be a number of features and some of them might make more influence on the target value than others. So
Applications in pandas "pandas is kind of excel in python" 1. Dataframe 2. Dataframe to numpy: df.to numpy() 3. Numpy to Dataframe: df = pd.Dataframe(array) 4. drop some columns: 5