Pandas
Applications in pandas
“pandas is kind of excel in python”
- Dataframe
1
2
3
4
5# 创建一个示例 DataFrame
df = pd.DataFrame({
'A': [1, 2, 3],
'B': [4, 5, 6]
}) - Dataframe to numpy:
df.to_numpy()
- Numpy to Dataframe:
df = pd.Dataframe(array)
- drop some columns:
1
2
3# drop by column name list
column_list = ['1', '2', '3']
df = df.drop(column_list, axis=1) # drop some columns - rename column:
1
2
3
4
5# rename by column name:
df.rename(columns={"A":"a", "B":"b"})
# rename by index
df.rename(index={0: "x", 1: "y"}) - get column name list:
1
column_list = df.columns # return "Index['column1', 'column2'...'column10']"
- get continuous part of dataframe
1
2
3
4
5df_part1 = df.iloc[:, :6] # first 5 columns of df
df_part2 = df.iloc[:5, :] # first 5 rows of df
df_index = df.iloc[1, 2] # element at index (1, 2) - modify a column
1
2
3df['column_name'] = df['column_name'].map({'ClassA': 1, 'ClassB': 2}) # mapping Label to Number
df['column_name'] = df['column_name'].map(lambda x: x * 2) # mapping element using lambda - form a dataframe by multiple arrays
1
2
3
4df = pd.DataFrame({
'Column1': array1,
'Column2': array2
})