File tree Expand file tree Collapse file tree 2 files changed +23
-0
lines changed Expand file tree Collapse file tree 2 files changed +23
-0
lines changed Original file line number Diff line number Diff line change @@ -10,4 +10,8 @@ Polars is an alternative to pandas that is designed to have better performance.
1010 directory with the same name.
11111 . ` polars_versus_pandas_benchmarks.ipynb ` : Jupyter notebook that compares the
1212 performance of polars and pandas on a variety of operations.
13+ 1 . ` create_csv_data.py ` : Python script to generate one or more large CSV files
14+ for benchmarking.
15+ 1 . ` create_csv_data.slurm ` : Slurm script to run ` create_csv_data.py ` on a
16+ cluster.
13171 . ` data ` : Directory containing the data used in the notebook.
Original file line number Diff line number Diff line change 1+ #! /usr/bin/env -S bash -l
2+ # SBATCH --account=lpt2_sysadmin
3+ # SBATCH --nodes=1
4+ # SBATCH --ntasks=1
5+ # SBATCH --cpus-per-task=1
6+ # SBATCH --mem=2G
7+ # SBATCH --time=01:00:00
8+ # SBATCH --mail-user=geertjan.bex@uhasselt.be
9+ # SBATCH --mail-type=FAIL,END
10+
11+ module purge
12+ module load Python/3.11.3-GCCcore-12.3.0
13+
14+ # This should generate a file of approximately 6 GB
15+ python ./create_csv_data.py \
16+ --files 1 \
17+ --cols 100 \
18+ --rows 2500000 \
19+ large_data
You can’t perform that action at this time.
0 commit comments