|
On 17 Sep, 02:01, Bryan Heit <bryans.spam.t...@gmail.com> wrote:
> I am having trouble reading in a text file. What I want is to
> generate an array of strins, 1 column wide by as many rows long as
> there is lines in the dataset. The dataset is an HTML page saved as
> text, containing bioinformatic information. I'm working on a script
> that'll pull specific species data out of the dataset, but cannot make
> much progress. I've tried several ways of reading the data
> (importdata, textscan, etc) to no avail. At best the first 4-5 lines
> get read in, then the read process is terminated (there are thousands
> of lines). The data itself looks as follows:
...
> Any help would be greatly appreciated.
You will have to write your own parser from scratch.
You should take some time to find out exactly what you
want to use these data for, and how, and come up with a
data structure that fits this use.
Once that's done, scan the file to extract (possibly
multi line) data items. Then scan the lines and extract
whatever data you want. Store the data in structures
or cell arrays.
My point is that this is a somewhat involved task that
might not be easily solved with canned routines. If you
think the above sounds daunting, find/hire somebody that
can help you - it is a standard programming task that any
computer science student can help with. Expect to spend
a bit of time explaining a helper how to separate the
data, though.
Rune
|