Metabolic fingerprinting of Melastoma malabathricum L. extracts using high-performance liquid chromatography-diode array detector combined with chemometric data analysis

Melastoma malabathricum L. is an essential herb that has been traditionally used to treat several ailments such as diarrhea, dysentery, wound, and stomachache. This plant is common across tropical Indo-Pacific archipelagos distributed in various geographical locations of Indonesia. In this study, an effective, accurate, and efficient highperformance liquid chromatography (HPLC) coupled to diode array detectors was developed to evaluate the effect of geographical variations on secondary metabolites of M. malabathricum samples through establishing a fingerprint profile, identification of variations of chromatogram peaks, and determination of anti-methicillin-resistant Staphylococcus aureus (MRSA) activity. Fingerprinting analysis of the extracts was analyzed using chemometric tools including unsupervised data analysis: principal component analysis (PCA) and hierarchical cluster analysis (HCA). PCA values were identified as principal component 1 (PC1) and principal component 2 with 41.229% and 19.208%, respectively. Anti-MRSA activity of all samples showed significant inhibition against MRSA bacteria. The results indicate the significance of considering geographic distribution during field-collection efforts since they demonstrate regional metabolic variations in secondary metabolites of M. malabathricum L., as illustrated by HPLC and their anti-MRSA activity. The results also confirm the utility of this approach as a comprehensive evaluation of metabolic variations based on their different geographical locations and pharmacological effects.


INTRODUCTION
Melastoma malabathricum L. is an important herb that grows in temperate and moist regions, like Asia and Pacific Islands, Indian Ocean Islands, South and South-East Asia, China, Taiwan, Australia, and the South Pacific Ocean (Joffry et al., 2012). This plant is traditionally used to treat several ailments related to infectious diseases, for example, diarrhea, dysentery, leucorrhoea, hemorrhoids, cuts and wounds, infection during confinement, toothache, stomachache, flatulence, sore legs, and thrush (Begum and Nath, 2000;Bhardwaj and Gakhar, 2005;Joffry et al., 2012). To treat cuts, the leaves of M. malabathricum are chewed up, pounded, and applied as a paste on wounds or cuts to stop bleeding (Joffry et al., 2012;Zakaria et al., 2006). The young leaves are eaten for healing of diarrhea, while the young premature leaves are consumed twice daily to cure dysentery (Sajem and Gosai, 2006). Leaves are also utilized to treat mouth ulcers, gastric ulcers, scars, pimples, and black spots on skin (Lohézic-Le Dévéhat et al., 2002). Combinations of leaves and other parts of M. malabathricum are also useful to treat ailments, such as leaves and roots are used as a digestive aid and applied to wounds and pox scars to aid the healing process, while the combination of leaves and flowers is used in the treatment of cholera and prolonged fever, as an astringent in leucorrhea and in curing chronic diarrhea (Joffry et al., 2012;Perry and Metzger, 1980;Sharma et al., 2001).
The metabolites of M. malabathricum leaves could be affected by several factors such as geographical location, altitude, cultivation area, micro-and macronutrients, pH of soil, climate, harvesting time, and storage method (Ncube et al., 2012). The effect of higher temperature, various precipitation levels, different soil moisture, and fertility might affect the plant's phenology, nutrient, antioxidant, and secondary metabolite levels (Kumar et al., 2017). Chemical constituent variations of plant extracts can be revealed by analytical and chemical techniques including chromatograms, spectrograms, and other graphs (Maimaiti et al., 2020). From these techniques, chemometrics can potentially characterize and identify both the marker components and the unknown components in a complex system (Maimaiti et al., 2020;Toyang and Verpoorte, 2013). Nowadays, assessments of one or two marker compounds or pharmacologically active components in herbal mixtures are currently carried out to assess the quality and authenticity of complex herbal medicines (Feng et al., 2014;Maimaiti et al., 2020).
The application of extraction and analytical methods of all metabolites in the species is challenging due to the complexity of the plant metabolomes (Leme et al., 2014). With the development of analytical techniques, chromatographic fingerprints have been widely used for the authenticity tests or to provide quality control of herbal medicines (Feng et al., 2014;Gan and Ye, 2006). In the past several years, chromatographic fingerprinting or discriminating analysis has been established using high-performance thin layer chromatography, fourier transform infra-red, and ultra performance liquid chromatography (Aziemin et al., 2014;Gu et al., 2017).
High-performance liquid chromatography (HPLC) is one of the most popular methods that is widely used due to its convenience and efficiency (Duan et al., 2012). To the best of our knowledge, there are no previous reports on the development of a fingerprint study of the M. malabathricum herbal profiles to distinguish their geographical locations in archipelago and terrestrial regions. This study will be the first one.
In this study, a simple, sensitive, and reliable HPLC coupled to diode array detector (DAD) methods was employed for establishing the metabolomic fingerprints of 72 samples of M. malabathricum leaves from various geographical locations and assessment for anti-methicillin-resistant Staphylococcus aureus (anti-MRSA) activity. Combined with chemometric tools, including principal component analysis (PCA) and hierarchical cluster analysis (HCA), a method and pattern recognition technique for quality control of M. malabathricum could provide the reference for quality control of traditional medicinal plants such as M. malabathricum.

Collection of plant material
Specimens of M. malabathricum leaves (72 samples) were collected during June 2019 from different localities and host sites covering six geographical zones in terrestrial and archipelago places of Indonesia. Each location contained six sampling spots. The leaves were identified by their morphological characteristics and then compared with previously deposited specimens in the Department of Pharmaceutical Biology, Universitas Gadjah Mada, Indonesia.
Fresh and healthy specimens of leaves of M. malabathricum were collected from individual plants at each location. Samples were placed in sterile plastic bags and were brought to the laboratory in an icebox for analyses. The specimen's code, location, date of collection, province, and coordinate sites are shown in Table 1.

Chemicals and reagents
Chromatographic grade acetonitrile and methanol were purchased from Fisher Scientific Corporation (Loughborough, UK). HPLC-grade trifluoroacetic acid (TFA) acid was purchased from Merck (Darmstadt, Germany). Distilled water was obtained from a local purified water provider.

Preparation of extracts
Air-dried leaves of M. malabathricum L. samples were ground with a pestle and mortar. The powder (100 mg) was homogenized with 5 ml 100% methanol using an ultrasonicator bath (40 kHz) for 30 minutes. After the mixture cooled to room temperature, the extract was filtered through a 0.45 µm membrane filter and 20 µl injected into the HPLC system analysis.

Antimicrobial properties
The microorganism MRSA (American Type Culture Collection 33591) was used for the determination of the antibacterial activity of M. malabathricum extracts. Microbroth dilution assays were used to determine percent inhibition of the samples against MRSA. The concentrations of the extracts used for antibacterial assay were all 50 µg/ml. In brief, 80 µl of broth heart infusion media was loaded in sterile flat-bottomed 96-well plates. A volume of 20 µl of each plant extract was pipetted onto the plate and mixed using a micropipette. The starting bacterial strain suspension was 1 × 10 6 CFU/ml and the wells containing bacterial inoculum without any extract were served as a control. The inoculated plates were incubated at 37°C for 18-24 hours. The experiments were repeated in triplicate. The growth of bacteria was detected by optical density (OD) at 600 nm and percentage bacterial growth inhibition was calculated based on the formula given as follows:

Data analysis
HPLC chromatogram profiles including retention time, area, height, concentration, and percent area were analyzed using Microsoft Excel version 2013. The data were processed by multivariate statistical techniques, namely PCA and HCA. The multivariate data analysis, PCA, and HCA were carried out by PAST3 software (Hammer et al., 2001).

Optimization of HPLC condition
After sample preparation, the determination of chromatographic separation conditions was the significant steps in the plant extract analysis. The analysis was nontargeted analysis; therefore, the main purpose was to obtain a satisfactory separation of all substances contained in tested samples. The untargeted metabolomics would represent the analytical approach for an accurate evaluation of the qualitative changes in bioactive substances. First, an HPLC-DAD metabolic fingerprinting method was optimized in order to detect as many metabolites as possible by varied different factors to increase the number of compounds separated (Alaerts et al., 2007). Various HPLC parameters including the mobile phase (methanol, water, and acetonitrile), detection wavelength (190, 245, 312, and 360 nm), and flow rate (1 ml/minute and 0.5 ml/minutes) were optimized to obtain separation efficiency and good shaped peaks of the chromatogram profiles. A good peak shape, chromatographic resolution, and retention time stability are vital for rapid data mining procedures and alignment within metabolomic analyses (Rubert et al., 2015).
After several different HPLC parameters were tried, a set of optimum HPLC parameters was reached, which consisted of a detection wavelength of 312 nm. This wavelength was chosen after comparing separated peaks and their profiles chromatograms with the number of wavelengths. The selected flow rate was 1 ml/minute that provided the best peak resolution. For the mobile phase, first water and 0.1% TFA in methanol were mixed with a composition of 70:30, and then to obtain the best separation, acetonitrile was added to the mixture solvents. By taking several combinations and compositions of mobile phase into consideration, water : methanol 0.1% TFA (90:10 and 70:30), water : methanol 0.1% TFA:acetonitrile (60:30:10), and water: methanol 0.1% TFA : acetonitrile (65:30:5), it was found that 0.1% TFA in the methanol-water-acetonitrile system gave better resolution and better peak shapes than other systems. TFA (0.1% v/v) was applied as an additive in the methanol, which could facilitate a better solution by reducing peak tailing (Chen et al., 2015). TFA is often used in laboratories to control the peak shape of basic compounds by ensuring that residual silanols ubiquitous in the silica-based C18 columns are in their neutral charge state (McClain and Streckfuss, 2014). The column temperature was set at 30°C.

HPLC fingerprints of extract samples
This study aimed to investigate M. malabathricum secondary metabolites in an untargeted analysis of their genetic diversity based on differences of geographical origin so as to set a framework of pattern-based secondary metabolites or potential uses determination. To accomplish the purpose, methanol extraction combined with HPLC-DAD was utilized for metabolic fingerprinting analysis of 72 samples of M. malabathricum collected from different locations including archipelago and terrestrial regions in Indonesia. Based on fingerprint analysis, the common peak is found present in each extract sample and these peaks should be well separated and have large areas. Extracts were analyzed in a range of UV wavelength of 190-800 nm. Metabolite assignments were based on retention time and peak area, with approximately 45 peaks found in each individual sample. Peaks that existed in all samples with good resolution and reasonable heights were assigned as common peaks to express the characteristics of M. malabathricum chromatogram extracts. The HPLC-DAD qualitative analyses of the representative fingerprint profiles of samples are shown in Figure 1.
Based on Figure 1, there were eight common peaks found in all samples from different geographical locations at retention time: 15.30, 15.60, 21.50, 22.09, 22.55, 24.40, 24.70, and 24.997 minutes. These retention times were found in each area but were different in the intensity of the peak. Sample A [Tanjungpinang (TP)] encompassed a significantly more intense peak with 12 major peaks. Peaks 2, 3, and 4 were likely to be clear compared to the other samples. The prominent peaks for sample A regarding the retention times were observed at 3. 89-5.81, 10.72-13.23, 20.49-22.54, and 24.11-26.76 minutes.
Overall, there was a need to conduct further investigation through more sophisticated methods to identify which metabolites were represented by the aforementioned HPLC data. Sample B [Bintan (BN)] was dominated by peaks 1, 2, 5, 6, 8, 9, 10, 11, and 12, whereas peak 7 constituted a major peak in samples A and C. Peak 8 was predominately shown in samples B, D, E, and F, and these are the samples from the terrestrial region. The peaks that were site-specific were 1, 2, 3, and 4 due to their presence in samples from only several regions, whereas the common peaks were found in peaks 5-8 and 9-12 due to their presence in specimens from all regions.
Peak number 5 (Rt: 15.30 minutes) was chosen as a maximum peak in the middle of the chromatogram. According to the literature, based on retention time and chromatographic profile, peak numbers 1 and 2 (Rt: 5.1 and 8.1 minutes) are likely to be flavonoid glycoside compounds: isoquercitrin and quercitrin, respectively. Metabolites were eluted in a decreasing order of polarity, where the compounds with high polarity appeared first in the chromatogram, followed by low polarity and nonpolar compounds. Differences and similarities among the samples of M. malabathricum extracts were apparent by visual investigations of the HPLC chromatograms.

Principal component analysis (PCA)
PCA is one of the powerful tools in chemometrics that involves unsupervised multivariate methods allowing the reduction of the data matrix, evaluation of samples clustering tendency, and outlier detection based solely on data matric (Massart, 1997). In PCA, no prior knowledge is required for the classification of the samples or to which specific group they belong. In this study, PCA was employed as a holistic approach to analyze HPLC data sets and to explore relative variability within the different extracts based on their geographical variations. It has been reported that PCA is an efficient tool, whereby several principal components (PCs) were extracted to verify the most possible variability based on the multivariate variation data. From 72 samples, peak chromatogram signals extracted from HPLC-DAD were used for PCA modeling. PCA score plots indicated a visual variation among groups. A macroscopic classification appeared in the score plot of the first two PCs, with which samples could be discriminated according to different regions. The model prescribed by principal component 1 (PC1) accounted for 41.229% of the variance, whereas principal component 2 (PC2) explained 19.208%. This classification is shown in Figure 2.
In the score plot, four PCs accounting for 76.94% of the total variance were considered significant. From the scatter points, the samples could be classified into three obvious main groups province. It is worth mentioning that 1em, 1bm, 1bt, and 1Dt from Cluster I were divided into Cluster II, which demonstrated that a closer geographical relationship was found within the TP district than the KR and MR districts. The sample records from the edges of BN district and KM district also confirmed this point. Moreover, compared with those of TP district, the samples from KM district have a larger scattered region, which suggested that the samples from KM district had a greater quality variation than those of the TP district. However, a large drift was found in a few samples, which indicated a remarkable difference in composition or content in comparison with other samples. The results indicated that different origins were the main reason for variations of PC constituents and amount. Moreover, other factors, including climate, soil type, harvesting time, storage condition, and harvesting and processing should also be considered. Geographical differences were very revealing in the results of classification. A simplified form of the data was obtained for the quality assurance of M. malabathricum by evaluating the variations and similarities between the samples.
According to Figure 2, on the left side of the plot, specimens for samples KR, MR, and KS were positioned (negative PC1 values), whereas samples TP and BN were segregated on the right side (positive PC1 values). Samples of the M. malabathricum L. extracts were scattered widely along the right-and left-hand sides of the PC2 and showed tight clustering. Sample KM showed up in sites on the PC2 side having positive PC2 values. It should be noted that, within the terrestrial regions (Riau province), separation based on specimen geographical origin could not be observed as clear in PCA analysis. For example, samples grown in different regions in MR, KS, and KR were all clustered together in one group.

Loading plot
PCA interpretation can be obtained from loading analysis. Loading is the correlation between the original variable and the new variable. Loading provides an indication of which original variables are very significant and influential in the formation of new variables. When the loading is high, the old variables are increasingly influencing the formation of new variables (Sharma, 1996).
The most influential samples of the various samples from different regions were identified through the loading plots (Fig.  3). Examination of the loadings plot of PC1 (Fig. 3A) suggested that chromatogram peak profiles of metabolites detected in the respective peaks at retention times 12. 30, 15.27, 15.59, 24.73, 24.97, and 22.09 minutes were associated positively to PC1. Positive association of variables in PCA revealed that all variables have the same influence and weight value. The peaks at retention times 9. 22, 12.03, 18.56, 19.38, 21.77, 23.51, 22.94, and 26.51 minutes were associated as neutral with little or no influence on the PC1, whereas the peaks at retention times 5. 81, 8.01, 10.25, 11.51, 10.72, 21.09, 21.52, and 24.62 minutes were associated negatively to PC1.

Hierarchical cluster analysis (HCA)
HCA was carried out as a continuation of PCA to assess if, by using a different classification algorithm, it is possible to expect more sensitive sample classification, despite their similarity to each other. With regard to PCA, HCA is an unsupervised multivariate method, which evaluates the clustering tendency of samples through an iterative process, which associates the samples by taking account of the chosen distance between samples and a linkage criterion according to which samples or clusters are merged (Bratchell, 1989). In line with PCA, HCA correctly classified the M. malabathricum samples into three clusters according to different geographical variations, as shown in Figure 4.
HCA showed three clusters referred to as groups, which were Clusters I, II, and III. The close clustering of samples from TP and BN suggested their similarity of metabolite contents in samples involved in Cluster I. The inspection of group II showed that samples from KR, KM, and KS were grouped together, whereas samples from KM were the most distant samples in comparison to others involved in Cluster III. The similarity and group evaluation results showed that location and region differences affected the quality of the extract samples.

Antibacterial activity
The anti-MRSA activity of the samples from different geographic distributions was measured by the dilution methods. The anti-MRSA activity of all samples is presented in Table 2.
The methanolic extract showed inhibition against test bacteria. Antimicrobial results showed that all extracts were active against MRSA. The present findings indicated that M. malabathricum L. may be further explored for the control of infectious diseases. Samples from KR showed the excellent anti-MRSA activity with inhibition activity of 71.63% ± 2.00%, followed by samples from KS, MR, KM, TP, and BN with inhibition of 69.91% ± 4.67%, 66.82% ± 4.01%, 66.41% ± 3.98%, 59.06% ± 7.69%, and 57.36% ± 3.14%, respectively. This finding was attributed to the PCA and HCA analyses. Samples from KR grouped in one cluster (Cluster III) means that it has a specific compound that was different from the others. It had excellent anti-MRSA activity compared to the others that were grouped and clustered close together.
inhibition of more than 50%. The secondary metabolite contents of M. malabathricum L. leaves were dominated by phenolic compounds such as flavonoids, phenolic acid, and tannins, so the activity against resistant bacteria might be due to the contribution of these phenolics. The mechanisms of antibacterial action of phenolic compounds are known to involve many sites of action at the cellular level by modification in permeability of cell membranes, the changes in various intracellular functions induced by hydrogen bonding of the phenolic compounds to enzymes, or the modification of cell wall rigidity (Bouarab-Chibane et al., 2019).

Geographical variation factor analysis and effect on phytochemical composition
The PCA was used to analyze the differences in the geographical location of samples in the archipelago and terrestrial regions in Indonesia. The score plot shows that ecological factors of TP and BN in the archipelago regions were clustered into one group, and those in KS and MR into another group. TP and BN are the regions in the archipelago area with an altitude less than 100 meters above sea level (masl), whereas KS and MR involve terrestrial regions with altitude more than 100-500 masl. However, the KM district belongs to the terrestrial area but it is grouped into one cluster. This difference between geographic variations of growth tremendously affects the phytochemical composition of the herbs. These clustering and grouping by chemometric tools are in line with anti-MRSA properties. The KM district which belongs to the terrestrial area has the highest anti-MRSA activity. According to Rieger et al. (2008), the production of phenolic compounds provided a significant correlation with altitudes, with increasing altitude followed by increasing the phenolic contents. This finding was also in line with studies by Alonso-Amelot et al. (2007) who found higher amounts of radical scavenging phenolics in higher altitudes sites. Meanwhile, Carey and Wink (2019) found that lower elevation samples contained higher amounts of quinolizidine alkaloids. In this study, the higher altitude (KM district) in the terrestrial regions has higher anti-MRSA activity than that of lower altitude in the archipelago regions. This finding is significantly correlated with previous studies. The phytochemical variation could also be affected by other environmental factors including saline environment, type of soil, micronutrients, and macronutrients. The presence of phenolic compounds has generally contributed to the antimicrobial activity. Polyphenols are well documented to have antimicrobial activity against a number of pathogenic bacteria (Abas et al., 2018;Karou et al., 2005).

CONCLUSION
Chromatographic fingerprint analysis has become an effective and comprehensive evaluation method for quality control of plant extracts and for species differentiation. The chromatogram profile with peak characteristic fingerprint peaks was selected to evaluate the secondary metabolites among 72 batches of M. malabathricum by chemometric tools including PCA and HCA. The results clearly demonstrated that the analytical methods of HPLC-DAD provide valuable qualitative information for the quality assessment of M. malabathricum. The PCA indicated good differentiation of samples of the variation of clusters based on PCs. The PCA analysis was grouped into three clusters. At the same time, we tested the crude plant extracts' anti-MRSA activity. These results revealed that all the extract samples were different based on their geographical location and the anti-MRSA activity depended on the secondary metabolites of the plants. These results demonstrated that the combination of HPLC chromatographic fingerprinting and chemometrics offers an efficient and reliable approach for quality evaluation of M. malabathricum extracts based on their geographical variation.