Program to calculate amino acid composition for a protein sequence in matlab

1 view (last 30 days)
For given problem it gives fasta file like sequences :>HMPREF9352_0001 rod shape-determining protein MreC [Streptococcus gallolyticus subsp. gallolyticus TX20005] MSLAFLFRNSGVVSAISSPIRSVVARVDSVVSAPFRFLDSANEEIRDLFNTYSENKELKQ KVAELEDQSELIDSLKEENEELNSEIGASSSITSQFSATGKVIVRSPVSWYDSLTVKLGK KNNITKKMLALSGGGLIGTVSDVDSTTSSITLLSNGSDFNIPIKITTSSAEVYGLLESYD SDKKCFVITNLNSSVDIEEGDSVVTSGLDGDTVANISVGTVSSVKNSSESLERVVYVTST ADFSDISYVTIVGD >HMPREF9352_0002 rod shape-determining protein MreD [Streptococcus gallolyticus subsp. gallolyticus TX20005] MIKVKFYKNKYFLLLLLFLLMLIDGQLSFLASSIFSYHLKVSSHLLLLAVLYFYHDKNKY FMFISSLVLGGIFDIYYLNRIGLVIFLLPILVIFTSKISKNFFVSNFQTLIFYIIVLFLF EIVGELGAILLGMTTMSMTYFIAYCFAPTLIYNILMYLIFQKVFKKVFLES >HMPREF9352_0003 CHAP domain protein [Streptococcus gallolyticus subsp. gallolyticus TX20005] MKKRILSAVLVSGVTLGTAAATVNADDYDTQIAAQDAVISNLTSEQAAAQSQVDALQEQV TSLQSQQDELEAQNAQLEAESQKLSEEIQALSSKIVARNESLKKQARSAQKTNTATSYIN TILNSKSISDAINRVAAVREVVSANEKMLEQQEADKAAIEQKQAENQEAINTVAANKATI EQNQAALATQQAELEAAQLNLSAQLATAEDEKASLVAQKEAAEQAAAEAAAAQAAAEAQA QAEAEAQAASVAQAQESVENGTATVDTTTDTSSQDSTTASTDTAAATEDTSSTQQAATVT PTATTTTSSSSSSSSASSSSSSSSSASTSSTASTSTSSSSSSSSSSSSVNTYPVGQCTWG VKSLASWVGNNWGNANQWIASAQAAGHSVGTTPQVGAVAVWPYDGGGYGHVAYVTAVQSS TSIQVMEANYAGNSSIGNYRGWFDPTSSTWGGGTVYYIYQ >HMPREF9352_0004 ribose-phosphate diphosphokinase [Streptococcus gallolyticus subsp. gallolyticus TX20005] MSYSDLKLFALSSNKELAEKVASAMGIELGKSTVRQFSDGEIQVNIEESIRGHHVFILQS TSSPVNDNLMEILIMVDALKRASAEKISVVIPYYGYARQDRKARSREPITSKLVANMLEV AGVDRLLTVDLHAAQIQGFFDIPVDHLMGAPLIADYFDRHGLVGDDVVVVSPDHGGVTRA RKLAQFLQTPIAIIDKRRSVTKMNTSEVMNIIGNVKGKKCILIDDMIDTAGTICHAADAL AEAGATAVYASCTHPVLSGPALENIEKSAIQKLVVLDTIYLSEERLIDKIEQISIAELIA EAITRIHEKRPLSPLFEMGTAK >HMPREF9352_0005 putative aromatic-amino-acid transaminase [Streptococcus gallolyticus subsp. gallolyticus TX20005] MSLTNRFNKNLDKIEVSLIRQFDQSISDVPGIMKLTLGEPDFTTPDHVKEAAKAAIDANQ SHYTGMAGLPALRQAAADFVKSKYNLSYNPDNEILVTIGATEALSATLTAILEPGDTVLL PAPAYPGYEPIANLVGAEIVEIDTTANDFVLTPEMLEKAILEQGDKLKAVLLNYPTNPTG VTYSREQIKALADVLKKYDIFVISDEVYSELTYNDEPHVSIAEYLPEQTILINGLSKSHA MTGWRIGLIFAPAIFTAQLIKSHQYLVTAAATMAQFAAIEALSAGKDDALPMKVEYIKRR DYIIDKMSALGFKIIKPDGAFYIFAKIPAGYEQDSFKFCQDFAREKAVAFIPGVAFGKYG EGYLRLSYAASMETITTAMERLKEFMEEHAN >HMPREF9352_0006 DNA repair protein RecO [Streptococcus gallolyticus subsp. gallolyticus TX20005] MQTKETYGLVLYNRNYREDDKLVKIFTETNGKHMFFVKHAGKSRFNSVIQPLTVAKFILK INDTGLSFIEDYKEVDSFKEINADLFKLSYASYVTALADAAVPDGVADPQLFAFVNKTLS LMEEGLDYEILTNIFEIQLLERFGVSLNFHECAFCHRVGLPFDFSHKYSGLLCPEHYGKD DYRSHLDPNVLYLVDRFQAIHFDELKTISVKPEMKRKLRLFIDDIYDNYVGLRLKSKKFI DDLGTWGNIMK
  1 Comment
Luuk van Oosten
Luuk van Oosten on 28 Jul 2015
You can use
fastaread
to import these kind of files.
But what is your question exactly? because you already have the amino acid composition of all your proteins; it is in the file. Or do you want a list per protein where it says how many L or W or A residues it has?

Sign in to comment.

Answers (0)

Products

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!