A Quick Example
In order to quickly get started with the ACPred-BMF program, I will demonstrate the entire process using Main_test.fasta which can be downloaded from Data-repository as input.
First, put the ‘Main_test.fasta’ into data/
cd ACPred-BMF
cp ../Main_test.fasta data/
Then, you only need to run the following order:
bash main.sh main
Then, you will get the results in the folder: /result/Main_test_main_prediction.csv
index,sequence,prediction
0,FLWWLFKWAWK,1
1,FAKLAKKALAKLL,1
2,GLFDIVKKIAGHIAGSI,1
3,VNFKKLLGKLLKVVK,1
4,WKKIPKFLHLLKKF,1
5,EQCGRQAGGKLCPNNLCCSQYGWCGSSDDYCSPSKNCQSNCKGGG,1
6,EADEPLWLYKGDNIERAPTTADHPILPSIIDDVKLDPNRRYA,1
7,FVGLAKVAAHVVPAIAEHF,1
8,FAKLLAKLAKKFAL,1
9,ARSYGNGVYCNNKKCWVNRGEATQSIIGGMISGWASGLAGM,1
10,KWKLFKKIGIGAFLHSAKKF,1
11,GIGKFLHSAKKWGKAFVGQIMNC,1
12,PAWRKAFRWAWRMLKKAA,1
13,FAKLLAKALKKLL,1
14,AIGKFLHSAKKFGKAFVGEIMNS,1
15,KRFKQDGGWSHWSPWSSC,1
16,FLPIALKALGSIFPKIL,1
17,TESYFVFSVGM,1
18,CLGIGSCNDFAGCGYAVVCFW,1
19,GMWSKILGHLIK,1
20,GFGALFKFLAKKVAKTVAKQAAKQGAKYVVNKQME,1
21,GMWSKILGHLKR,1
22,FKCRRWQWRMKKLGAPSITCVR,1
23,KWKLFKKIGIGAFLHLAKKF,1
24,CGESCVWIPCISAAIGCSCKNKVCYRAIP,1
25,KWKSFAKTFKSAKKTVAHTALKAISS,1
26,KSCCPNTTGRNIYNTCRFGGGSREVCARISGCKIISASTCPSDYPK,1
27,GKFMSLLKHILK,1
28,FAKKLKKLAKKL,1
29,FAKLLAKLAKKSL,1
30,FLPIIAKVLSGLL,1
31,ITCPQVTQSLAPCVPYLISG,1
32,FAKLLAKLAKKIL,1
33,FAKKLAKKLAKAAL,1
34,FAKLLAKLAKKAA,1
35,FLGWLFKWASK,1
36,GIGAVLKVLTTGLPALISWIKRKRQQ,0
37,FNRGGYNFGKSVRHVVDAIGSVAGILKSIR,0
38,GVPCGESCVFIPCITGVIGCSCSSNVCYLN,1
39,FFPVIGRILNGIL,1
40,KMWSKILGHLIR,1
41,FALALKALKKLKKALKKAL,1
42,FLGALFKALSKLL,1
43,FFPNVASVPGQVLLKKIFCAISKKC,1
44,GEYCGESCYLIPCFTPGCYCVSRQCVNKN,1
45,FLSLIPHIVSGVAALAKHL,1
46,SQETFSDLWKLLPEN,0
47,WHSDMEWWYLLG,1
48,FAEPLPSEEEGESYSKEPPEMEKRYGGFM,0
49,PAWRKAFRWAARMLKKAA,1
50,VNWKKILGKIIKVAK,1
51,GLFDIVKKIAGHIASSI,1
52,WFKKIPKFLHLLKKF,1
53,GIIKKIIKKIIKKI,1
54,KWKSFLKTFKSLKKTVLHTALKAISS,1
55,FAKKLAKALL,1
56,FLGWLFKWAKK,1
57,ACYCRIPACLAGERRYGTCFYRRRVWAFCC,1
58,FALAKALKKAL,1
59,GKWKKILGKLIR,1
60,LALMLPGC,1
61,FALALKALKKA,1
62,FLKLLKKLAAKLF,1
63,DILTFEHYWAQLTS,1
64,VAKFLAKFLKKAL,1
65,FRGLAKLLKIGLKSFARVLKKVLPKAAKAGKALAKSMADENAIRQQNQ,1
66,VNWKKVLGKVVKVVK,1
67,MWKWFHNVLSWWWLLADKRPARDYNRK,1
68,GSEGPLKPGARIFSFDGKDVLRHPT,1
69,LLGMIPLAISAISALSKL,1
70,FLPVIAGVAAKFLPKIFCAITKKC,1
71,ILGPVISTIGGVLGGLLKNL,0
72,FWGALAKGALKLIPSLFSSFSKKD,1
73,GIGVLLSAGKAALKGLAKVLAEKYAN,1
74,LGFWGLPH,1
75,RRRRRRRRGNLWAAQRYGRELRRMSDEFVDSFKK,0
76,LVRGCWTKSYPPKPCFVR,0
77,GLFKVIKKVASVIGGL,1
78,FLSHIAGFLSNLF,1
79,FLGAIAAALPHVINAVTNAL,1
80,FALALKALKKAL,1
81,FAKKLAKKAKLAKKL,1
82,CSCRTSSCRFGERLSGACRLNGRIYRLCC,1
83,FAKKLAKKLL,1
84,GLLGLLGSVVSHVVPAIVGHF,1
85,FVDLKKIANIINSIFKK,1
86,FLIGMTHGLICLISRKC,0
87,KAAKKAAKAAKKAAKAAKKAA,1
88,GLPVCGETCFGGTCNTPGCACDPWPVCTRD,1
89,GLFDIVKKVVGTLAGL,1
90,GRRKRKWLRRIGKGVKIIGGAALDHL,0
91,KLAKLAKKLAKLAK,1
92,PNEVNRLAHLRLH,1
93,GLFGVLGSIAKHVLPHVVPVIAEK,1
94,GLPICGETCVGGSCNTPGCSCSWPVCTRN,1
95,PDEDAINNALNKVCSTGRRQRSICKQLLKK,1
96,GLFKVLGSVAKHLLPHVVPVIAEK,1
97,GLFDIIKKVASVIGGL,1
98,KAAKKWAKAAKKAAKAWKKAA,1
99,GSLCGDTCFVLGCNDSSCSCNYPICVKD,1
100,KWKLF,1
101,YKQCHKKGGHCFPKEKICLPPSSDFGKMDCRWRWKCCKKGSG,0
102,GLPTCGETCFGGTCNTPGCTCDPWPVCTHN,1
103,AIPCGESCVWIPCISTVIGCSCSNKVCYR,1
104,FLSLALAALPKFLCLVFKKC,1
105,PAWRKARRWAWRMKKLAA,1
106,FLPGLIAGIAKML,1
107,KAKLAKKALAKLL,1
108,GGLRSLGRKILRAWKKYGPIIVPIIRIG,1
109,PRFWEYWLALME,1
110,FLPVIAGLLSKLF,1
111,GLFDVIKAVASVIGGL,1
112,FFGSVLKVAAKVLPAALCQIFKKC,1
113,FAKLLAKLAKKAL,1
114,FLKLLKKLAAKFLPTIICKISYKC,1
115,YCAYYSPRHKTTF,1
116,HHPHGHHPHGHHPHGHHPHG,1
117,GACFSIAHECGA,1
118,VRRFPWWWPFLRR,1
119,FLPILINLIHKGLL,1
120,FLSIIAKVLGSLF,1
121,FFSASCVPGADKGQFPNLCRLCAGTGENKCA,0
122,FIHHIIGGLFSAGKAIHRLIRRRRR,1
123,PPKSQ,1
124,KKKFPWWWPFKKKCKKKFPWWWPFKKKC,1
125,VAKKFAKKFKKFAKKFAKFAFAF,1
126,GLLSVLGSVVKHVIPHVVPVIAEHL,1
127,FGKGIGKVGKKLL,1
128,AWKLFDDGV,1
129,PEWFKCRRWQWRMKKLGA,1
130,YERDPRQQYEQCQRRCESEATEEREQEQCEQRCEREYKEQQRQQEEE,0
131,FALALKALKK,1
132,GLFDVIAKVASVIKKL,1
133,CAHNLTHAC,1
134,GIGKFLKKAKKGIGAVLKVLTTGL,1
135,ATCKAECPTWDSVCINKKPCVACCKKAKFSDGHCSKILRRCLCTKEC,1
136,PAWRKAARWAWRMLKKAA,1
137,GIGTKILGGVKTALKGALKELASTYAN,1
138,CIKNGNGCQPNGSQNGCCSGYCHKQPGWVAGYCRRK,0
139,FALALKALKKALKKLKKALKKAL,1
140,FLSLLPSIVSGAVSLAKKL,1
141,ACYCRIGACVSGERLTGACGLNGRIYRLCCR,1
142,ATCDLLSMWNVNHSACAAHCLLLGKSGGRCNDDAVCVCRK,0
143,FLPVIAGVAANFLPKLFCAISKKC,1
144,GFLDTFKNLALNAAKSAGVSVLNSLSCKLFKTC,0
145,QSHLSLCRWCCNCCRSNKGC,0
146,ALWKNMLKGIGKLAGKAALGAVKKLVGAES,1
147,KWKLFKKIGIGKFLHLAKKF,1
148,FDIVKKIAGHIAGSI,1
149,CSTNTFSLSDYWGNKGNWCTATHECMSWCK,1
150,KSCCKNTTGRNIYNTCRFAGGSRERCAKLSGCKIISASTCPSYPDK,1
151,FFHHIFRGIVHVGKTIHRLVTG,1
152,AISYGNGVYCNKEKCWVNKAENKQAITGIVIGGWASSLAGMGH,1
153,ETCASRCPRPCNAGLCCSIYGYCGSGAAYCGAGNCRCQCRG,1
154,GFRDVLKGAAKAFVKTVAGHIANI,0
155,LGGIVSAVKKIVDFLG,1
156,GIIKKIIIKKIIIKKIIIKKI,1
157,GIGKFLHSAKKFAKAFVAEIMNS,1
158,FALA,1
159,KNWKKILKKIIKVVK,1
160,KILRGVCKKIMRTFLRRISKDILTGKK,1
161,SPLGYGFAVRNSG,0
162,GLFGKLIKKFARKAISYAVKKARGKH,1
163,ANTAFVSSAHNTQKIPAGAPFNRNLRAMLADLRQNAAFAG,0
164,PDEDAINDALNKVCSTGRRQRSICKQLLKK,1
165,FAFGKGIGKVGKKLL,1
166,FAKKLLAKALKL,1
167,CHANLTHAC,1
168,FKSWSFCTPGCAKTGSFNSYCC,0
169,FKRLAKIKVLRLAKIKR,1
170,FLPLILRKIVTAL,1
171,ATCDLLSAFGVGHAACAAHCIGHGYRGGYCNSKAVCTCRR,1
172,LLKELWTKIKGAGKAVLGKIKGLL,0
173,DWTFANWSCLVCDDCSVNLTV,0
174,GIPCGESCVWIPCISAALGCSCKNKVCYRN,1
175,GLIGSIGKALGGLLVDVLKPKL,0
176,VLPLISMALGKLL,0
177,FISAIASMLGKFL,1
178,SAISCGETCFKFKCYTPRCSCSYPVCK,0
179,INWKKIASIGKEVLK,0
180,DSHAKRHHGYKRKFHEKHHSHRGY,0
181,IFGSLFSLGSKLLPTVFKLFSRKKQ,0
182,MNNTIKDFDLDLKTNKKDTATPYVGSRYLCTPGSCWKLVCFTTTVK,0
183,CLGVGSCNDFAGCGYAIVCFW,1
184,GVLGTVKDLLIGAGKSAAQSTLKTLSCKISNDC,0
185,EKYTEVPEYI,0
186,VKCAVKDTYSCFIVRGKCRHECHDFEKPIGFCTKLNANCYM,0
187,APKGVQGPNG,1
188,PAQPFRIKKRQGPFERP,0
189,GTVPCGESCVFIPCITGIAGCSCKNKVCYIN,0
190,RCVCTRGFCRCVCTRGFC,0
191,AQCGAQGGGATCPGGLCCSQWGWCGSTPKYCGAGCQSNCR,1
192,GLMSTLKDFGKTAAKEIAQSLLSTASCKLAKTC,0
193,GLWNSIKIAGKKLFVNVLDKIRCKVAGGC,0
194,KTCENLSGTFKGPCIPDGNCNKHCRNNEHLLSGRCRDDFRCWCTNRC,0
195,GLFTKFAGKGIKDLIFKGVKHIGKEVGMDVIRVGIDVAGCKIKGVC,0
196,GFLGPLLKLGLKGAAKLLPQLLPSRQQ,1
197,KNLRRIIRKGIHIIKKYG,0
198,ATCDLLSGFGVGDSACAAHCIARGNRGGYCNSKKVCVCRN,1
199,RWKFFKKIEKVGQNIRDGIIKAGPAVAVVGQAASIT,1
200,SLNVMRKGIRKQPVSSGKRGGVNDYDM,0
201,RLCPRVRIRVCR,0
202,SVFAFENEQSSTIAPARLYK,0
203,LLKKLLKWLKK,0
204,AVNIPFKVHFRCKAAFC,0
205,GRSKKLGKKIEKAGKRVFNAAQKGLPVAAGVQAL,0
206,SIGFDGLNDPDIVAR,0
207,CAETCVVLPCFIVPGCSCKSSVCYFN,1
208,GVGDIFRKIVSTIKNVV,0
209,GIPCAESCVWIPCTVTALLGCSCSNNVCYN,1
210,GLLDSLKNLAINAAKGAGQSVLNTLSCKLSKTC,0
211,CLGIGSCNNFAGCGYAVVCFW,1
212,GLLDTLKNMAINAAKGAGQSVLNTLSCKLSKTC,0
213,ILSAIWSGIKSLF,0
214,YAFGYPS,1
215,VTCFCRRRGCASRERLIGYCRFGNTIYGLCCRR,0
216,RRCICTTRTCRFPYRRLGTCLFQNRVYTFCC,0
217,LRDLVCYCRTRGCKRREHMNGTCRKGHLMYTLCCR,0
218,KCWNLRGSCREKCIKNEKLYIFCTSGKLCCLKPKFQPNMLQR,0
219,HFLGTLVNLAKKIL,1
220,DKLIGSCVWGAVNYTSNCNAECKRRGYKGGHCGSFANVNCWCER,0
221,WNPFKELERAGQRVRDAVISAAAVATVGQAAAIARGG,0
222,GKNGVFKTISHECHLNTWAFLATCCS,1
223,YCNRRTGKCQRM,1
224,LE,1
225,GLFPKFNKKKVKTGIFDIIKTVGKEAGMDVLRTGIDVIGCKIKGEC,0
226,VFIDILDKMENAIHKAAQAGIGIAKPIEKMILPK,0
227,GLWDSIKNFGKTIALNVMDKIKCKIGGGCPP,0
228,FLPILASLAAKLGPKLFCLVTKKC,1
229,FFPIVGKLLFGLSGLL,0
230,RDWERREFERRQNELRREQEQRREELL,0
231,GLLDTLKNMAINAAKDAGVSVLNTLSCKLSKTC,0
232,HRHQGPIFDTRPSPFNPNQPRPGPIY,1
233,MWGRILAFVAKYGTKAVQWAWKNKWFLLSLGEAVFDYIRSIWGG,0
234,ILGPVLGLVGNALGGLIKKI,0
235,GCLEFWWKCNPNDDKCCRPKLKCSKLFKLCNFSF,0
236,QLGDVLQKAGEKIVRGLKNIGQRIKDFFGKLTPRTES,0
237,ASGWVCTLTIECGTLVCAC,0
238,LIKIVPAMICAVTKKC,0
239,LLEL,0
240,GIPCGESCVFIPCITSVAGCSCKSKVCYRN,1
241,LLKWLKKWLKK,0
242,KYYGNGVHCGKKTCYVDWGQATASIGKIIVNGWTQHGPWAHR,1
243,VFQFLGRIIHHVGNFVHGFSHVF,0
244,RRSRKNGIGYAIGYAFGAVERAVLGGSRDYNK,0
245,GLLDAIKDTAQNLFANVLDKIKCKFTKC,0
246,GLGSFLKNAIKIAGKVGSTIGKVADAIGNKE,1
247,KTCEHLADTYRGVCFTNASCDDHCKNKAHLISGTCHNWKCFCTQNC,0
248,FLGGLIKIVPAMICAVTKK,1
249,VCSCRLVFCRRTELRVGNCLIGGVSFTYCCTRVD,1
250,GTACGESCYVLPCFTVGCTCTSSQCFKN,0
251,GKKLFVNVLDKIRCKVAGGC,0
252,GIFSLIKGAAKLITKTVAKEAGKTGLELMACKVTNQC,0
253,VIPFVASVAAEMMPHVYCAASRKC,0
254,RYCERSSGTWSGVCGNTDKCSSQCQRLEGAAHGSCNYVFPAHKCICYYPC,0
255,GLLLDTLKGAAKDIAGIALEKLKCKITGCKP,0
256,ILKKWPWWPWRRK,1
257,RIITCSCRTFCFLGERISGRCYQSVFIYRLCCRG,0
258,INWKKIASIGKEVL,0
259,GLLSRLRDFLSDRGRRLGEKIERIGQKIKDLSEFFQS,0
260,KSYGNGVQCNKKKCWVDWGSAISTIGNNSAANWATGGAAGWKS,0
261,GLFGRLRDSLQRGGQKILEKAERIWCKIKDIFRG,0
262,KWKLFKKIGIGAVLKVLT,0
263,LDEPNMDTISKSREYKCKIDLDCSNHIACRHCSYRNCKCDHGTCKCMP,0
264,KTCENLADTFRGPCFATSNCDDHCKNKEHLLSGRCRDDFRCWCTRNC,0
265,ADDKNPLEEAFREADYEVFLEIAKNGL,0
266,QFTNVSCTTSKECWSVCQRLHNTSRGKCMNKKCRCYS,0
267,GVLDAFRKIATVVKNVV,0
268,FDIIKKVASVVG,1
269,KWKWKW,0
270,RFIPPILRPPVRPPFRPPFRPPFRPPPIIRFFGG,0
271,KQQLATEAESAGPIL,0
272,EFKRCWKGQGACRTYCTRQETYMHLCPDASLCCLSYALKPPPVPKHEYE,0
273,KGRGKQGGKVRAKAKTRSS,0
274,FFPIVGKLLSGLF,1
275,DKLIGSCVWGAVNYTSNCRAECKRRGYKGGHCGSFLNVNCWCET,0
276,EQCGRQAGGATCPNNLCCSQYGY,1
277,KWKLFKKIEKVGQGIGAVLKVLTTGL,1
278,KIKWFKTMKSLAKFLAKEQMKKHLGE,0
279,ATCRKPSMYFSGACFSDTNCQKACNREDWPNGKCLVGFKCECQRPC,0
280,GILDIAKKLVGGIRNVLGI,0
281,WYVKKCLNDVGICKKKCKPEEMHVKNGWAMCGKGRDCCVPAD,0
282,LLPILGNLLNGLL,0
283,AGIGKIGDFIKKAIAKYKN,1
284,GLFSKFSGKGIKNFLIKGVKHIGKEVGMDVIRTGIDVAGCKIKGEC,0
285,RFPWWWPFLR,1
286,SYVGDCGSNGGSCVSSYCPYGNRLNYFCPLGRTCCRHAYV,0
287,GLFSKFAGKGIKNLIFKGVKHIGKEVGMDVIRTGIDVAGCKIKGEC,0
288,RCYTNDDCKDGQPCPVPLACLFGSCICPWKSQSKLPICQIICANLD,0
289,ENCGRQAG,0
290,LDSLSFSYNNFEEDD,0
291,AVKDTYSCFIMRGKCRHECHDFEKPIGFCTKLNANCYM,0
292,ANLDAIIKIQAWARMWAARRQYL,0
293,KFFRKLKKSVKKRAKEFFKKPRVIGVSIPF,0
294,GSVFNCGETCVLGTCYTPGCTCNTYRVCTKD,0
295,KLLKKLLKWLK,1
296,FLPIIAGMAAKVICAITKKC,1
297,GGLRSLGRKILRAWKKYG,1
298,SLGSFMKGVGKGLATVGKIVADQFGKLLEA,0
299,INWLKLGKKMMSAI,0
300,MKVFFLFAVLFCLVRRNSVHISHQEARGP,1
301,KAGLAFPVGRVHRLLRK,1
302,FTIAEPYIHPCMKGFCSFKSECANKCIFMGHHKGGDCIGGLDGIYCCCLA,0
303,WFRKQLKW,1
304,LLSLVPHAINAVSAIAKHF,0
305,YSRCQLQGFNCVVRSYGLPTIPCCRGLTCRSYFPGSTYGRCQRF,0
306,FLPLLLSALPSFLCLVFKKC,1
307,GVLDAFRKIATVVKNLV,0
308,IFGSLFSLGSKLLPSVFKLFSRKKQ,0
309,GRFRRLRKKTRKRLKKIGKVLKWIPPIVGSIPLGC,0
310,RRWFWR,0
311,GLPCGESCVFIPCITTVVGCSCKNKVCYND,1
312,DYDWSLRGPPKCATYGQKCRTWSPPNCCWNLRCKAFRCRPR,1
313,GVFTLIKGATQLIGKTLGKEVGKTGLELMACKITKQC,0
314,EKCLRWQWRMRKYGG,0
315,ACYCRIPACLAGERRYGTCFYLGRVWAFCC,1
316,QCMQLETSGQMRRCVSQCDKRFEEDIDWSKYDNQE,0
317,GPIQISYNYNYGPCGRYCGILGVSPGDNLDCGNQR,1
318,GLLSGVLGVGKKIVCGLSGLC,0
319,SPKKTKPVKPKKVA,0
320,GLWSKIKEAAKTAGLMAMGFVNDMV,0
321,SGKRWWRRKK,0
322,QQCGRQASGRLCGNRLCCSQWGYCGSTASYCGAGCQSQCRS,0
323,GNPKVAHCASQIGRSTAWGAVSGA,1
324,RWKRWWRRKK,0
325,SIGSALKKALPVAKKIGKIALPIAKAALP,0
326,SISCGETCTTFNCWIPNCKCNHHDKVCYWN,0
327,GGLKKLGKKLEGVGKRVFKASEKALPVLTGYKAIG,0
328,GYGCPFNQYQCHSHCSGIRGYKGGYCKGTFKQTCKCY,0
329,DKLIGSCVWGAVNYTRNCNAECKRRGYKGGHCGSFANVNCWCET,0
330,GLFLDTLKGLAGKLLQGLKCIKAGCKP,0
331,YSKSLPLSVLNP,1
332,GLPVCGETCTLGKCYTAGCSCSWPVCYRN,1
333,MSKLVQAISDAVQAQQNQDWAKLGTSIVGIVENGVGILGKLFGF,0
334,QVFTLIKGATQLIRKTLGEQ,0
335,SGKLWWRRKK,0
336,WKSESLCTPGCVTGALQTCFLQTLTCNCKISK,0
337,FLGALWNVAKSVF,0
338,SLSRFLSFLKIVYPPAF,0
339,FIMDLLGKIF,1
340,KIPCGESCVWIPCVTSIFNCKCKENKVCYHD,0
341,DKLIGSCVWLAVNYTSNCNAECKRRGYKGGHCGSFLNVNCWCET,0
342,FLPILGKLLSGLL,1
343,SVPTSVYTLGIKILWSAYKHRKTIEKSFNKGFYH,0