can MATLA read very large CSVs (>150GB)?

1 view (last 30 days)
sani on 23 Nov 2022
Commented: Cris LaPierre on 23 Nov 2022
Hi all,
I need to read and analyze six very large CSV files. In each file, I'm interested in only two columns. I already used opt to try reducing the amount of data, but my PC is constantly collapsing (since it reached the max memory of 128GB and the allocated memory as well).
My question is, do you know another approach to handle such large CSV files?
Eventually, I want to split these files, but I need to read them first.
Is that right?
Thanks :)

Answers (2)

Steven Lord
Steven Lord on 23 Nov 2022
Use the tools in MATLAB for processing Large Files and Big Data.

Cris LaPierre
Cris LaPierre on 23 Nov 2022
This sounds like what is called Big Data to me. MATLAB does have functionality for working with this data, A good starting place may be the Big Data with MATLAB page. You may be particularly intersted in the functionality for Tall Arrays.
Here's a good example you may be able to follow to get started.
Cris LaPierre
Cris LaPierre on 23 Nov 2022
MATLAB Tables were designed to work with Tall Arrays. See the 'Extended Capabilities' section of the coresponding doc pages

Sign in to comment.


Find more on Large Files and Big Data in Help Center and File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!