I have a pandas dataframe with few columns.
Now I know that certain rows are outliers based on a certain column value.
For instance
column Vol
has all values around 12xx and one value is 4000 (outlier).
I would like to exclude those rows that have Vol
column like this.
So, essentially I need to put a filter on the data frame such that we select all rows where the values of a certain column are within, say, 3 standard deviations from mean.
What is an elegant way to achieve this?