Leveraging off higher plant phylogenetic insights for antiplasmodial drug discovery

2023-11-13 10:53PhanankosioyoLukeInvernizziSephoraiandaWiehanRudolphWarrenAndayiingxunWangNeilCrouchandVineshaharaj
Natural Products and Bioprospecting 2023年5期

Phanankosi Μoyo, Luke Invernizzi, Sephora Μ.Μianda, Wiehan Rudolph, Warren A.Andayi,Μingxun Wang, Neil R.Crouchand Vinesh J.Μaharaj*

Abstract The antimalarial drug-resistance conundrum which threatens to reverse the great strides taken to curb the malaria scourge warrants an urgent need to find novel chemical scaffolds to serve as templates for the development of new antimalarial drugs.Plants represent a viable alternative source for the discovery of unique potential antiplasmodial chemical scaffolds.To expedite the discovery of new antiplasmodial compounds from plants, the aim of this study was to use phylogenetic analysis to identify higher plant orders and families that can be rationally prioritised for antimalarial drug discovery.We queried the PubΜed database for publications documenting antiplasmodial properties of natural compounds isolated from higher plants.Thereafter, we manually collated compounds reported along with plant species of origin and relevant pharmacological data.We systematically assigned antiplasmodial-associated plant species into recognised families and orders, and then computed the resistance index, selectivity index and physicochemical properties of the compounds from each taxonomic group.Correlating the generated phylogenetic trees and the biological data of each clade allowed for the identification of 3 ‘hot’ plant orders and families.The top 3 ranked plant orders were the (i) Caryophyllales, (ii) Buxales, and (iii) Chloranthales.The top 3 ranked plant families were the (i) Ancistrocladaceae, (ii) Simaroubaceae, and (iii) Buxaceae.The highly active natural compounds(IC50 ≤ 1 μΜ) isolated from these plant orders and families are structurally unique to the ‘legacy’ antimalarial drugs.Our study was able to identify the most prolific taxa at order and family rank that we propose be prioritised in the search for potent, safe and drug-like antimalarial molecules.

Keywords Natural products, Plants, Phylogenetics, Μalaria, Drug-resistance, ‘Hot’ plants

1 Introduction

Malaria is a vector-borne tropical disease caused by unicellular protozoan parasites of the genusPlasmodium[1].Despite conspicuous progress in controlling and managing malaria, the disease remains a serious public health challenge.Malaria is currently endemic in 84 countries,with the World Health Organisation (WHO) reporting the African Region as the most afflicted by this disease [2].In 2021 there was a total of 247 million clinical malaria cases and 619,000 malaria-induced fatalities globally, with the African Region accounting for 95% of the reported cases and fatalities.Current antimalarial drugs are becoming less effective due to the emergence and spread of drug-refractoryPlasmodiumparasite strains [3].These strains arise as a product of mutations,most notably due to either DNA replication errors or damage induced by reactive oxygen species [4, 5].These mutations give rise to phenotypes with changes in, for example, either drug targets or transporters which conferPlasmodiumparasites with resistance to antimalarial drugs.This resistance phenomenon has been observed for all previously used antimalarials [6], including the current WHO-recommended first-line treatment drugs for malaria, namely artemisinin-based combination therapy (ACT) [6].This highlights the need to discover and develop new and alternative malaria treatment regimens.

One promising strategy for discovering new curative antimalarial compounds is the identification of plants that produce compounds with antiplasmodial activity.Plants have evolved to produce a diverse array of chemical compounds to defend themselves against pathogens and parasites, many of which have potential medicinal properties.This has made them a reliable source for the discovery of privileged chemical scaffolds, which have served as a foundation for developing a plethora of pharmaceutical agents [7].Some of the linchpin malaria chemotherapeutics were likewise discovered from plants.From aCinchonaspecies (Rubiaceae), the alkaloid quinine was isolated.This compound served as a template from which derivatives, including chloroquine, were synthesised.Lapachol, a naphthoquinone first isolated in 1882 from the bark ofTabebuiaavellanedae(Bignoniaceae), served as a scaffold which inspired the development of the antimalarial drug atovaquone.Similarly,from the Chinese herbArtemisiaannua(Asteraceae),the sesquiterpene lactone artemisinin was isolated and semi-synthesised to form prolific fast-acting derivatives,namely artemether, dihydroartemisinin and artesunate which are the core constituents of the ACT regimen [8].Over the last century, these plant-derived antimalarials have saved millions of lives [9–11].In view of this, there is merit in the continued investigation of plants in search of novel antimalarial agents to redress the drug-resistance scourge.

Given the diversity and expansiveness of the plant kingdom (ca.370,000 flowering plant species [12]) and limited research resources, there is a need for a rational strategy to streamline and focus drug screening projects on selected plant species.Adoption of such strategies is envisaged to expedite antimalarial drug discovery by simplifying the plant screening process.Moreover,this is anticipated to come with the added advantage of an increased likelihood of discovering promising leads(hits) as research resources will be focused on the ‘hot’taxonomic groups (i.e., taxa with an overrepresentation of active compounds) [13, 14].One logical approach adopted in some drug discovery projects that prioritise plant subjects for evaluation, is phylogenetic analysis[15–17].Such phylogenetic analyses allow for identification of ‘hot’ plant genera, families or orders that have been demonstrated to produce bioactive compounds against specific therapeutic targets [18].This concept emanates from the premise that phylogeny and biosynthetic pathways are correlated; therefore, the production of specific bioactive natural products with peculiar biological properties will likely be common to closely related plant species at the level of genus, family, or families within an order [14].In line with this principle various members of the filamentous bacterial genusStreptomyceshave yielded an array of secondary metabolites (including tetracyclines, aminoglycosides and macrolides) with commercially useful antimicrobial activity [19, 20].Similarly, amongst higher plants, the Amaryllidaceae plant family is well established to exclusively produce specific alkaloids, including the lycorine-type alkaloids, which exhibit a distinct pharmacological profile [21].The correlation of phylogenetic analysis and pharmacological data allows us to effectively identify ‘hot’ plant taxonomic groups of specific interest to man based on what is experimentally known about compounds isolated from plant species in those plant orders and families.From these taxonomic groups, closely related, untapped species can be rationally prioritised for pharmacological evaluation[18].

Over recent decades, numerous natural compounds isolated from different plant species have been evaluated for their in vitro antiplasmodial activity.This study aimed to use phylogenetic analysis to identify ‘hot’ plant orders, and families that: (i) produce active antiplasmodial compounds (IC50≤ 10 μM) with (ii) acceptable resistance index (RI ≤ 10), (iii) a selectivity index (SI ≥ 10), and(iv) drug-like properties.We collated data on active and inactive antiplasmodial plant-derived compounds from literature published between 1964 and 2021.We determined, including through resolution of synonymy, the identity of plant species yielding antiplasmodial isolates and used the information to construct phylogenetic trees,which were correlated to quantified antiplasmodial and cytotoxicity data.From the generated trees, we were able to establish the distribution patterns of plant species in different ‘hot’ plant families and orders.We believe this analysis will facilitate the selection of taxa warranting further evaluation in antimalarial drug discovery programs, to optimize project outcomes.

2 Results, discussion and conclusion

2.1 Descriptive analysis: articles evaluated, plant taxonomy, and antiplasmodial screening

A PubMed database query using the key phrase “Plasmodiumfalciparumand natural product”, limiting the search to the abstract, yielded a total of 3863 articles (Fig.1).These articles were published within a 58-year period ranging from 1964 to 2021 (Fig.1).From this pool of articles, duplicates and publications reporting on non-higher plant-derived natural products were excluded.Furthermore, studies where natural compounds were neither isolated nor screened in vitro againstP.falciparum, were excluded.Following the application of these exclusion criteria, we filtered to a total subset of 455 articles.

Fig.1 Number of articles published per year (1964 to 2021)on “Plasmodium falciparum and natural product”.The annual number of publications on the topic of “Plasmodium falciparum and natural product” has evidently increased over the years within the last decade, being the most productive in that regard.These search results were realised following the query made on PubΜed and limited to abstracts

From the relevant 455 articles, 2426 natural compounds, reportedly either active or inactive in vitro againstP.falciparumparasites were manually compiled.These compounds were collected along with components of plant species from which they were initially isolated,and their respective antiplasmodial activities were determined.These molecules were isolated from 439 plant species belonging to 99 vascular plant families referred to 37 plant orders (Table 1).This total number of plant species only represents a small fraction (ca.0.1%) of all known higher plant species worldwide.Theca.2400 plant-derived compounds which our study was limited to is substantially less than that analysed in a similar study by Zhu et al.[15] whereca.31,000 compounds were considered.In their study, Zhu and colleagues examined the species-distribution of 939 clinically approved natural product-derived drugs, 369 clinical candidates, and 119 preclinical candidates.Furthermore, 13,548 marine derived natural products and 19,721 bioactive secondary metabolites were included in their study [15].The natural product derived drugs and bioactive compounds considered were from several sources including plants,microorganisms, and marine organisms [15].Furthermore, these were drugs and bioactive compounds for several therapeutic areas [15].Disease-area focused investigations have previously analysed the number of compounds consistent with our study, although not for antiplasmodial activity.For example, in their investigation to examine the phylogenetic distribution of anticancer drugs, Li et al.[22] analysed 207 natural product derived drugs.From a malaria drug discovery standpoint our study is, to the best of our knowledge, the first to extensively examine the relationship between phylogeny and antiplasmodial activity of natural product compounds.Earlier approaches have focused on compound classes in relation to antiplasmodial activity, exampled by the study of Egieyeh et al.[23] which used cheminformatics profiling to prioritise natural products for antimalarial drug discovery.In their study they managed to analyse 1040 natural product compounds isolated from plants,microorganisms, and marine sources [23].

Table 1 Taxonomic representation of plant species yielding compounds subsequently investigated for antiplasmodial activity,arranged by Order

Of the 2400 compounds analysed in our study, the Asterids and Rosid lineages were the most overrepresented clades with 6 and 10 plant orders, respectively (see Table 3).The plant order with the greatest number of different accepted plant species investigated was the Sapindales (n = 65), closely followed by Asterales (n = 48) and Gentianales (n = 55).The Chloranthales (4%), Cupressales (1.8%), Canellales (1.75%), Buxales (1.54%), and Sapindales (0.95%) were the most investigated relative to the total number of accepted plant species known to occur in those orders.In contrast, the Asparagales (0.02%), and Alismatales (0.02%) were evidently the orders least investigated, considering their species richness (Table 1).

Compounds isolated from plant species in the different taxonomic groups were primarily assessed for activity against the 3D7 (n = 426), D6 (n = 254), and NF54(n = 182) intra-erythrocytic asexualP.falciparumparasite drug-sensitive (D-S) strains.In vitro evaluation of potency against the intra-erythrocytic asexualP.falciparumparasite drug-resistant (D-R) parasites was predominantly carried out on the K1 (n = 689), W2 (n = 399) and Dd2 (n = 317) strains (Table 2).

Table 2 Top 5 D-S and D-R intra-erythrocytic asexual P. falciparum parasite strains most targeted for in vitro antiplasmodial screening of plant-derived compounds in reports considered

2.2 Phylogenetic analyses correlated to biological data

Following identification to species rank of taxa yielding compounds tested for antiplasmodial activity phylogenetic trees (cladograms) of higher plant orders and plant families were constructed using the NCBI Taxonomy database [24] and graphically displayed using an online tool, viz., the ‘interactive Tree of Life’ (iTOL) (Figs.2 and 3) [25].The constructed trees are consistent with the taxonomic classification and nomenclature reflected in‘The World Flora Online’ [26].The relationship betweenantiplasmodial activity and phylogenetic relationships at order and associated family levels was then determined and expressed as ‘’hit rates (HR) %’ for taxonomic groups with ≥ 10 compounds isolated from them.The HR were calculated by dividing the number of compounds with an IC50≤ 10 μM by the total number of compounds isolated and experimentally evaluated for activity against either the D-S or D-R plasmodia.Plant taxa were correlated to the calculated HR of their compounds.Given the extensive diversity of higher plants globally, we considerit prudent to strategically focus discovery phase research on either ‘hot’ plant orders, or ‘hot’ families which yield compounds with high HR.This is assumed to increase the likelihood of successfully identifying active antiplasmodial compounds within a short time frame.

Generally, compounds from most plant orders showed high HR of 35% and 43% against asexual D-S and D-RP.falciparumparasites, respectively (Fig.2).Compounds isolated from the Buxales had the highest HR (96%, n(number of compounds) = 25).This was closely followed by the Caryophyllales (HR = 79%, n = 53) and Laurales(HR = 67%, n = 12) (Fig.2).The lowest HR were noted for the Dioscoreales (HR = 0%, n = 14), Ericales (HR = 13%,n = 23), and Magnoliales (HR = 16%, n = 91).Against D-R strains, the Celastrales (HR = 92%, n = 12) and Buxales(HR = 91%, n = 11) were found to have the highest HR,whereas the Dioscoreales (HR = 0%, n = 14), and Brassicales (HR = 10%, n = 10) presented the lowest HR against thisPlasmodiumform.Noteworthy plant orders with markedly different HR in relation to D-S and D-R parasites were the Celastrales (51% difference), Brassicales(30% difference) and Piperales (22% difference).

Fig.2 Phylogenetic tree of plant orders investigated in vitro for their activity against intra-erythrocytic asexual P. falciparum parasites.The tree was generated using NCBI Taxonomy and processed using iTOL.ND not determined: This applies for plant orders with < 10 compounds isolated from them and subsequently evaluated for their antiplasmodial activity.The hit rate (HR) is the % of compounds with an IC50 ≤ 10 μΜ for each plant order.D-S—drug-sensitive.D-R—drug-resistant

Compounds isolated from the Simaroubaceae (in Sapindales) demonstrated the highest HR (100%, n = 19)against the D-S parasites (Fig.3).This plant family was closely followed by the Buxaceae (in Buxales) (HR = 96%,n = 25), Ancistrocladaceae (in Caryophyllales) (HR = 76%,n = 42), and Lauraceae (in Laurales) (HR = 70%, n = 10)(Fig.3).The lowest HR were noted for the Dioscoreaceae(in Dioscoreales) (HR = 0%, n = 14), Malvaceae (in Malvales) (HR = 0%, n = 10), and Bignoniaceae (in Lamiales)(HR = 7%, n = 14).Against D-R strains, the Ancistrocladaceae (HR = 96%, n = 45) had the highest HR, marginally more than that for the Celastraceae (in Celastrales)(HR = 92%, n = 12), Buxaceae (HR = 91%, n = 11) and Simaroubaceae (HR = 91%, n = 55).The Dioscoreaceae(HR = 0%, n = 14), Scrophulariaceae (in Lamiales)(HR = 8%, n = 25), Rubiaceae (in Gentianales) (HR = 15%,n = 53), Phyllanthaceae (in Malpighiales) (HR = 16%,n = 18), and Malvaceae (HR = 16%, n = 19) demonstrated the lowest HR against the D-R parasites (Fig.3).Plant families with notably different HR against the D-S and D-R parasites were Celastraceae (51% difference), and Loganiaceae (in Gentianales) (29% difference).

Fig.3 Phylogenetic tree of plant families investigated in vitro for their activity against the intra-erythrocytic asexual P. falciparum parasites.The tree was generated using NCBI Taxonomy and processed using iTOL.ND not determined: This applies to plant families with < 10 compounds isolated from them and subsequently evaluated for their antiplasmodial activity.The hit rate (HR) is the % of compounds with an IC50 ≤ 10 μΜ for each plant family

Overall, from this preliminary analysis, the Buxales and Caryophyllales have consistently emerged as the ‘hottest’plant orders.The Simaroubaceae, Buxaceae, and Ancistrocladaceae have emerged as the ‘hottest’ plant families,routinely displaying high HR against both the D-S and D-RPlasmodiumparasites (Fig.4).

Fig.4 Launched drug chemical space of the ‘legacy’ antimalarials and natural product compounds isolated from the ‘hot’ plant orders and families.The online Python library for chemical space visualization ChemPlot [33], was used to launch the chemical space of the natural compounds and ‘legacy’ antimalarials

2.3 Antiplasmodial activity and cytotoxicity of compounds isolated from different plant orders

To further assess the productivity of the different plant orders and families, we expanded our analysis by determining the % of compounds, in each plant order and family, classified as either highly active (HA) (IC50≤ 1 μM),moderately active (MA) (10 μM ≥ IC50> 1 μM) or poorly active (PA) (IC50> 10 μM) (Table 3).We anticipate that research that prioritises plant orders or families that produce mainly HA compounds, will yield more rewarding leads.Furthermore, considering the need to address the resistance phenomenon, we examined the resistance index (RI) of compounds as an indicator of their effi-cacy against the D-R strains relative to D-SPlasmodiumparasite strains.Additionally, as a proxy indicator for the preference of compounds to compromisePlasmodiumparasite proliferation ahead of that of mammalian cell lines, we assessed the selectivity index (SI) of the compounds investigated per plant order and family.We consider that plant orders and families producing compounds with an RI ≤ 10 and an SI ≥ 10 should be preferentially prioritised for investigation (as per guidelines provided by the Medicines for Malaria Venture (MMV)—https:// www.mmv.org/).

Table 3 Antiplasmodial activity and cytotoxicity of compounds isolated from plants of different orders

This analysis showed that many of the compounds isolated from plant species in different plant orders were found to be PA.Exceptions to this were the plant orders Caryophyllales (n = 53), with 55% of its compounds found to be HA againstP.falciparumD-S strains (Table 3).Similarly, against D-R strains, many compounds (45%)from the Caryophyllales (n = 62) were classified as HA.Likewise, most of compounds (46 to 75%) from the orders Chloranthales (n = 44), Buxales (n = 11) and Celastrales (n = 12) were classified as being HA against the D-R strains.Despite receiving considerable attention,the majority (ranging from 61 to 80%) of the compounds isolated from orders Asterales (n = 151), Gentianales(n = 192), Lamiales (n = 56), Sapindales (n = 168), Malpighiales (n = 86), Fabales (n = 138), and Magnoliales(n = 91) were classified as PA against D-SP.falciparumstrains.This pattern was noted for the same orders,namely Asterales (n = 145), Gentianales (n = 190), Lamiales (n = 98), Sapindales (n = 287), Malpighiales (n = 186),Fabales (177), and Magnoliales (n = 117), against D-R strains with most of the compounds (ranging from 49 to 78%) being classified as PA.Generally, most of the compounds in all plant orders demonstrated an acceptable RI, i.e., ≤ 10.In addition to having most of their compounds classified as either HA and MA, remarkably,many molecules (> 70%) from the Buxales (n = 31), Chloranthales (n = 13), and Caryophyllales (n = 31) showed a good SI, i.e., ≥ 10 (Table 3).

Consistent with observations for the plant orders,most compounds isolated from most plant families were classified as PA (Table 4).In variance with this generalisation were the Ancistrocladaceae (n = 42) and Simaroubaceae (n = 19), from which most compounds (52 and 74%, respectively) were classified as HA against the D-SP.falciparumstrains.The productivity of the Ancistrocladaceae (n = 45) and Simaroubaceae (n = 55) was retained against D-R strains with 49 and 62% of compounds from each respective plant family being classified as HA.Further plant families with most compounds(ranging from 45 to 75%) classified as HA against D-R were the Buxaceae (n = 11), Chloranthaceae (in Chloranthales) (n = 44), Celastraceae (in Celastrales) (n = 12),and Loganiaceae (in Gentianales) (41).Intriguingly, most compounds (63%) from the Loganiaceae (n = 84), were classified as PA against the D-S parasites.Despite receiving considerable research interest (as observed by the number of compounds isolated from them and screened for their antiplasmodial activity), many compounds isolated from the families Fabaceae (in Fabales), Rutaceae(in Sapindales), Rubiaceae (in Gentianales), Annonaceae(in Magnoliales) and Asteraceae (in Asterales) were classified as PA against both D-S and D-RP.falciparumparasites (Table 4).

Table 4 Activity and cytotoxicity of compounds isolated from plant species in respective plant families

The resistance index for all plant families is good since > 90% of compounds have an RI < 10.Complementing the good activity profile of most compounds in the Ancistrocladaceae (n = 43), Buxaceae (n = 31), and Chloranthaceae (n = 17) was a good SI for most of them (> 60%)(Table 4).

2.4 Drug-likeness assessment of compounds produced by different higher plant orders and families

Potent antiplasmodial compounds should have good drug-like properties for ease of development into orally available preclinical and clinical candidates with reduced attrition rates in clinical trials.Drug-like properties include physicochemical descriptors including for example molecular weight (MW), consensus LogP(cLogP), hydrogen bond donors (HBD) and acceptors(HBA).Preference for drug discovery should be given to plant orders and families that produce compounds with good drug-like properties.To assess the druglikeness of compounds isolated from the different plant orders and families, in silico calculated molecular andphysicochemical descriptors of compounds were evaluated using different sets of criteria and by utilising data of clinically available antimalarial drugs (Tables 6 and 7).The analysis showed that a significant portion of these compound descriptors agreed with those of the criteria outlined by the Medicines for Malaria Venture,Lipinski’s Rule of 5 [27], Veber’s rules [28] and Ghose filters [29] indicating good characteristics of druglikeness.However, some compounds isolated from the Buxales, Chloranthales, and Caryophyllales did not wholly fulfil the set criteria (Table 5).These discrepancies were noted for the respective families, which included the Buxaceae, Chloranthaceae and Ancistrocladaceae (Table 6).Out of the seven obtained physicochemical descriptors for compounds in these plant orders and families, some molecules did not fall within the specified set criteria for MW, the HBA, molar refractivity (MR), and cLogP.

Table 5 Calculated mean physicochemical descriptors for compounds isolated from different plant orders*

Table 6 Calculated mean physicochemical descriptors for compounds isolated from different plant families*

Table 7 Ranking of plant orders for antimalarial drug discovery

Table 8 Ranking of plant families for antimalarial drug discovery

Further evaluation showed average descriptor values of many of the compounds from most plant families and orders compared well with those of approved antimalarial drugs.Similarly, not all antimalarial drug descriptors fell within the criteria for drug-likeness.For example,lumefantrine has a MW of 528.9, a MR of 152.6, cLogPof 7.9 and doxycycline and tetracycline both have 6 HBD and a TPSA of 181.6 A2.

Encouragingly, compounds from most of the plant orders and families were shown to be devoid of the pan-assay interference compounds (PAINS) substructures.The synthesis accessibility (SA) of many compounds from most plant orders and families was consistent with much of the currently available antimalarials.Notable exceptions were the compounds in the orders Buxales, Chloranthales, and Caryophyllales and the families Buxaceae, Chloranthaceae and Ancistrocladaceae.Their SA (ranging from 5 to 7.3) was in the same range as those of the artemisinin derivatives (6.5 to 6.7) (Tables 5 and 6).Most appealing will be plant orders and families which produce compounds which with a low SA value are easy to synthesise, exampled by the quinolines.

2.5 Overall ranking to identify ‘hot’ higher plant orders and families for prioritisation in drug discovery projects

We formulated a rational ranking system which we used to identify the ‘hottest’ plant orders and families.We awarded points to different plant orders and families depending on how they performed in three attributes:the hit rate, HA for both D-S and D-R parasites, and the SI.We decided against including RI and drug-likeness in the ranking system as most of the compounds from most plant orders and families largely complied with set criteria for these indicators, so markedly reducing the value of these characteristics in improving selection resolution.Points were sequentially awarded based on the position of the taxonomic group in performance relative to other groups.For example, 1 point was given, per performance indicator, to the order or family with the highest hit rate,HA, and SI.The second, and third-positioned taxonomic groups were given 2 and 3 points, respectively.This scoring was repeated in that sequence until points were assigned to all plant families and orders.Thereafter, the total number of points was calculated for each family and order.We finally rationalised the number of points based on the number of indicators scored per taxonomic group resulting in the normalised points.This was done to ‘level’ the system, as not all taxonomic groups were scored for all indicators.We then ranked the taxonomic groups based on the number of points with the family or plant order, with the lowest number of normalised points ranking 1st and that with the greatest number of points ranking last (Tables 7 and 8 for orders and families respectively).

Having adopted this ranking system, the following results were obtained and found to be consistent with earlier observations: the top 3 ranked orders were the (i)Caryophyllales, (ii) Buxales, and (iii) Chloranthales.The top 3 ranked plant families were the (i) Ancistrocladaceae (in Ancistrocladaceae), (ii) Simaroubaceae (in Sapindales), and (iii) Buxaceae (in Buxaceae) (Tables 7 and 8).The most prominent natural product classes found to be active (IC50≤ 10 μM) per each plant order and family were isoquinoline alkaloids and naphthoquinones (Caryophyllales and Ancistrocladaceae), steroid alkaloids and lupane triterpenoids (Buxales and Buxaceae), quassinoids(Simaroubaceae) and cycloeudesmane sesquiterpenoids(Chloranthales) (Table 9).We point out that majority of compounds classified by NPClassifier are manually classified as napthylisoquinoline (NIQs) alkaloids in their respective research publications, most of which emanate from the research group of Professor G.Bringmann [30].Encouragingly, the HA compounds from these top plant taxa are structurally different to the ‘legacy’ antimalarial drugs [31].

Table 9 Natural compound classification of active compounds in ‘hot’ plant orders and families#

The pressing need to discover and develop new antimalarials to mitigate drug resistance led us to consider the use of phylogenetic analysis coupled with bioactivity correlation to establish ‘hot’ plant orders and families worthy of prioritisation for antimalarial drug discovery projects.This endeavour has culminated in the identification of 3 ‘hot’ plant orders and families.One of the most intriguing findings of the current study was that the most promising plant orders and families are those from which either no antimalarial drug has previously been isolated or they are generally less studied (judging by either number of compounds isolated from them or number of publications reporting their investigation).In contrast,the families Asteraceae and Rubiaceae have received significant interest in their antiplasmodial evaluation.This interest we believe is driven by two factors, (i) previous discovery of antimalaria drugs from these families and (ii)their extensive use traditionally for malaria treatment as documented by ethnobotanical studies (e.g., [34, 35], and[36]).However, with a combined total ofca.400 compounds isolated from the Asteraceae and Rubiaceae plant families, it is striking to note that only 16 of them (ca.4%) have demonstrated IC50values ≤ 1 μM either against D-S or D-RPlasmodiumparasites.In contrast, of the less investigated plant families Simaroubaceae, Ancistrocladaceae, and Buxaceae, with a combined total of only 86 compounds isolated and screened for activity against the D-SPlasmodiumstrains, 45 of these compounds(53%) were reported to show IC50values of ≤ 1 μM.Furthermore, the compounds isolated from these 3 families and those from other ‘hot’ taxonomic groups show good selectivity, outperforming the Asteraceae and Rubiaceae families (see Table 4).

From a medicinal chemistry perspective, emphasis is placed on the discovery of new compounds that occupy a chemical space that is different from that of the current clinically available antimalarial drugs.This chemical diversity brings with it a high likelihood of targeting novel biological space [37].Having a unique target, compared to current antimalarials, increases the chances of the new scaffolds being potent against clinically drugresistantPlasmodiumstrains.Our analysis shows that the HA compounds isolated from the ‘hot’ plant orders and families identified from the study occupy a differentchemical space to current antimalarials providing further impetus to explore these taxonomic groups.For example,the most prolific compounds from the Simaroubaceae are classed as quassinoids, which the parasite has clinically not been exposed to [38].Similarly, from the Buxaceae family, the most prolific compounds are steroidal alkaloids, which are chemically distinct from any of the clinically available curative malaria drugs [39].

She agreed to see him again, writing her name and phone number in the perfect script she d been taught in third grade, her teacher an ex-nun who had engraved42 the rules of penmanship in her small charges

The quassinoids class of natural products, which the Simarobuceae is well-established to produce, has also demonstrated in vivo antiplasmodial activity albeit with some level of toxicity noted.For example, the quassinoid bruceine B was shown to have an ED90of 2.82 mg/kg/day.At a concentration threefold the ED90, bruceine B was observed to be 100% lethal against the mice used in the study [38].Nonetheless, this compound was shown to be less toxic than other quassinoids, so raising hope that through medicinal chemistry less toxic but highly potent scaffolds from this natural product class could be synthesised.Notably, the synthesis of potent yet less in vivo toxic quassinoid analogues has been successfully undertaken for cancer studies [40].NIQs have similarly shown exceptional in vivo activity.The NIQ dioncophylline C curedP.bhergei-infected mice following a single oral dose (50 mg/kg/day) with no observed toxicity [41].While NIQs are structurally highly complex, approaches to their synthesis have been developed and comprehensively outlined [42] by the group of Professor G.Bringmann [30].Moreover, simplified analogues of this class of compounds have been synthesised and proved to be potent against intra-erythrocytic asexualPlasmodiumparasites [43].Their clinical efficacy remains to be demonstrated.

It is interesting to note the generally high HR of most natural products isolated from many plant orders and families discussed in this study.These HR were substantially higher than those observed for synthetic compounds which have been described to be as low as 0.3%and 0.05% in some studies [44, 45].However, caution needs to be exercised when considering these high hit rates for natural products.Firstly, the bioassay-guided assay approach is a popular approach used to isolate compounds from plants, where guidance is based on the observed bioactivity resulting in the isolation of bioactive molecules, albeit with varying potency.Secondly, the cutoff point (IC50≤ 10 μM) for the HR outlined in this paper is noted to be more tolerant than that used elsewhere,e.g., < 1.25 μM and ≤ 2 μM adopted by Plouffe et al.[45]and Dechering et al.[44], respectively.Nevertheless, the high HR is still encouraging, motivating for the continued investigation of plant-derived compounds to treat malaria.

In conclusion, given the need to accelerate antimalarial drug discovery, plants are a promising oasis deserving of continued investigation in this endeavor.Our study has shown that understudied plant orders and families are more deserving of intensified investigation in search of novel antimalarial drugs.We anticipate these findings will help direct researchers to focus and streamline their investigations on the few plant orders and families most likely to result in the discovery of highly active antiplasmodial compounds that can be channeled into medicinal chemistry programs.

3 Material and methods

3.1 Literature search

To identify published manuscripts for exploration in our study, we queried the PubMed database (https:// pubmed.ncbi.nlm.nih.gov/) [46] searching for publications documenting antiplasmodial properties of compounds isolated from plants.The key phrase used was “Plasmodium falciparumand natural product” limiting the “Text Availability’ option to ‘Abstract”.The search was restricted to manuscripts published between 1964 and 2021.We then manually systematically screened the publications applying the following exclusion criteria.

i) Manuscripts documenting only antiplasmodial activity of compounds isolated from other natural sources,e.g., microorganisms, marine organisms, other than vascular (i.e., higher) plants were discarded.

ii) Manuscripts in which no compounds were isolated and screened were disregarded.

iii) Manuscripts in which only in vivo studies were carried out were excluded.

iv) Duplicate articles were excluded.

Articles subsequently remaining following the above process, were selected for the study.

3.2 Taxonomic terminology

From the selected manuscripts, we manually collated compounds reported along with relevant pharmacological data, and species-of-origin saving this information on a Microsoft Excel spreadsheet.The captured data was verified 3 times to ensure all details collected were accurate and consistent with current nomenclature and taxonomy.Given disparities in taxon circumscriptions and related nomenclature inherent at all taxon ranks, we harmonised our systematic approach through aligning with the World Flora Online (WFO) database (http:// www.world flora online.org/) as of 7 November 2022, including in the assignment of species authors.In instances where the species authors for taxa had not been provided in the source pharmacological publications it was occasionally necessary, when more than one identical basionym exists,to resolve the identity of the research subject through consideration of the reported plant collection locality relative to data provided in the Global Biodiversity Information Facility (GBIF) (https:// www.gbif.org/).

3.3 Phylogenetic tree generation and data analysis

The phylogenetic trees were constructed as follows.Firstly, Text (.txt) files with the names of plant orders and families (as per WFO) were added onto the NCBI Taxonomic Browser ‘Common Tree’ [24].The resulting tree was saved as a ‘Phyllip file’ (.phy) which was graphically displayed and manipulated using the iTOL online tool(v5) [25].Here, default settings were used with only the following modifications made; Branch lengths—‘Ignore’and Scaling factors—‘0.5 × horiz.’.

The SMILES of the compounds collated were either collected from databases including PubChem [47] and ChemSpider [48] or were generated from 2D structures drawn on ChemDraw Ultra (v8) [49].Using the generated SMILES, compounds were classified into specific classes using the NPClassifier tool [32].To evaluate drug-likeness we computed average physicochemical properties using the SwissADME online suite software [50].Analysis, including calculation of mean values for RI, SI etc.,was carried out using Microsoft Excel.Chemical space analysis was carried out on ChemPlot using structural similarity, PCA algorithm and scatter plot type options[33].

Acknowledgements

This research work was supported by a grant from the Department of Science and Innovation (DSI) of South Africa awarded to Vinesh Μaharaj.The UP ISΜC acknowledges the South African Μedical Research Council as Collaborating Centre for Μalaria Research.Phanankosi Μoyo was supported by a grant from the University of Pretoria Postgraduate Research Support Bursary for his Postdoctoral Fellowship.Sephora Μianda Μutombo was supported by funds from the University of Pretoria Postgraduate Research Support Bursary, South Africa and the L’Oréal-UNESCO for Woman in Science grant.Luke Invernizzi was supported by funds from the doctoral funds from National Research Foundation of South Africa.

Author Contribution

Conceptualisation of study, PΜ and VJΜ.; Collecting data sets, PΜ, LI, SΜΜ,WR, and AWA.; Formal analysis, PΜ, AWA, WR, ΜW and NRC.; Writing—original draft, PΜ.; Writing—review and editing, PΜ, VJΜ, LI, SΜΜ, AWA, ΜW, and NRC.Supervision and funds acquisition VJΜ.; All authors read and approved the final manuscript.

Funding

Foundation L’Oréal (4500453975).

Availability of data and materials

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Declarations

Competing interests

ΜW is a co-founder of Ometa Labs LLC.

Received: 3 August 2023 Accepted: 3 September 2023