What is the root of the problem?
From a holistic point of view the debate covers different levels. First, the discrimination of dead or alive microorganisms represents a crucial problem in (environmental) bacteriology. This basic problem has existed for decades and has not yet been solved. In this respect, the terms “vitality” and “viability” are often used and quite often mixed - some researchers completely interchange these terms [
9].
Second, “vital stains” are generally only surrogates, but are quick and simple devices in studies examining, for example, the antibacterial effect of substances. Here the problem is the large variety of staining substances and thus of staining principles, so that the
“plethora of choices adds to confusion”[
10]
. Similarly Pamp et al. [
11] state:
“More recently developed stains, such as the Syto stains …. can efficiently stain cells in virtually any color of the rainbow”. That might sound humorous – but merely reflects the problem. As just mentioned, Tawakoli et al. [
7] used combinations of several stains (for example FDA, cFDA, TCFDA, EB, as well as SYTO 9/PI, Sytox red, besides Calcein AM). Davey [
10] also refers to FDA, PI and SYTO 9, but moreover to substances like SYTO green I, DIBAC4, pyronine Y, rhodamine 123 and thiazole orange. It is no wonder that this author [
10] refers to another four reviews only to inform about the
“modus operandi” of the different fluorescent stains and the
“huge diversity of possibilities in terms of stain selection, concentration, staining time, etc”.
When analyzing further, the problem becomes even more complex. Some authors criticize the limited use of propidium iodide (PI) as a cell viability
(sic!) indicator [
12,
13], while others monitored striking differences between SYTO 9 and SYTO 12 regarding the influence of porins on uptake kinetics of these dyes [
14]. This means that when antibacterial substances disturbing the cell membrane integrity are assessed the use of those vital staining techniques may be misleading. An inherent aspect of this problem, namely the suitability of staining methods, is the dependency on the stains’ concentrations of the results (see later).
Third, the users are for the most part unfortunately unaware that such “seductive” tests have only been validated for a very limited number of bacterial species [
13,
15]. From 15 000 “hits” (250 reviews) generated in PubMed by asking for “flow cytometry & bacteria”, only three were left after filtration of the database when using “biofilm” as an additional tracing term. None of this three articles contributes to the debate.
It is crucial that “vital stains” and, much more important, their combinations are directly compared to conventional bacteriological data. As discussed later in detail this cannot be the assessment of CFU, but of PE. In dentistry some corresponding work has been completed using single or a lucid number of species
in vitro without conducting bacteriological tests [
6,
16‐
18], while some companies used this staining method trusting
per se in its reliability [
19,
20]. When the SYTO 9/PI combination was in fact related to corresponding microbiological assessments the outcome was inconsistent [
21‐
23]. In our opinion it is impossible to find out studies where particular examples of these vital stains and its combinations were properly compared and related to microbiological data concerning natural multispecies biofilms like dental plaque.
Finally, the dyes may be or are toxic or mutagenic. With respect to the list of compounds, as previously mentioned and excerpted from [
7] and [
10], it must be determined whether there is a harm or risk in the use of these substances.
Viability versus Vitality
Netuschil [
8] recorded 49 terms to describe “vitality states” of microorganisms (for example: active microbes, cryptic growth, direct viable count [DVC], progressive dormancy, vegetative dormancy, dwarf cells, moribund cells, nonculturability, nonplateable, stasis survival, reproductive viability, viable but not culturable [VBNC], non-viable but resuscitable, vital, viviform, etc.) as cited in 34 different corresponding publications [
1,
2,
4,
24‐
54] (cf. Table
1). While the table displays references from 1962 up to 1998, the debate is older and was already relevant at the turn of the 19
th to the 20
th century [
55‐
60]. One example is the “Great Plate Count Anomaly” [
61,
62] see also [
63]. Even at that time vital stainings were debated to be used as a trial to overcome the shortcomings of culture techniques [
64‐
68]. It seems that the past discussion [
69] was “revitalized” at the turn of this century [
41,
70‐
78]. Of note is that some terms (e.g., dormant, VBNC) are even relevant when referring to probiotic bacteria [
79,
80].
Table 1
Terms used to describe “vitality states” of microorganisms (from[
8]
)
| |
| |
| “Shut down cells”, “shut down state” [ 17] |
“Anabiotic (dormant) state” [ 15] | |
“Bags of enzymes” [ 4, 12] | |
| |
Culturable, culturability [ 4, 10, 11, 31] | “Substrate accelerated death”, “substrate |
– Nonculturable, nonculturability [ 22, 24] | |
| |
| |
| |
| |
| |
Viability [ 3, 4, 6, 9, 12, 13, 15, 16],[ 19, 22, 23, 26, 29] |
| – “Apparent viability” [ 6, 15] |
– “Progressive dormancy” [ 1, 30] | – “Reproductive viability” [ 23] |
– “Vegetative dormancy” [ 15] | |
Dwarf cells, inactive dwarfs, ultramicrobacteria [ 6, 14, 15, 29, 32] | |
| – VBNC hypothesis [ 2‐ 4, 7] |
| |
| – “Viable but nonrecoverable” [ 31] |
| – “Non-viable but resuscitable” [ 16] |
| – “Unculturable but viable” [ 28] |
| |
| |
References:
| |
1
Barcina et al. 1989 [ 24] |
18
Kogure et al. 1979 [ 40] |
|
19
Korber et al. 1996 [ 41] |
3
Bogosian et al. 1996 [ 26] | |
4
Bogosian et al. 1998 [ 27] | |
5
Bowden & Hamilton 1998 [ 28] |
22
Morgan et al. 1993 [ 43] |
6
Button et al. 1993 [ 29] |
23
Nebe-von Caron et al. 1998 [ 44] |
|
24
Nilsson et al. 1991 [ 45] |
8
Colwell et al. 1985[ 31] | |
9
Dawe & Penrose 1978[ 32] | |
10
Duncan et al. 1994[ 33] |
27
Postgate & Hunter 1962[ 48] |
11
Gonzáles et al. 1992[ 34] | |
12
Gribbon & Barer 1995 [ 35] |
29
Roszak & Colwell 1987a [ 2] |
|
30
Roszak & Colwell 1987b [ 50] |
|
31
Roszak et al. 1984 [ 51] |
15
Kaprelyants et al. 1993 [ 4] | |
|
33
Whitesides & Oliver 1997[ 53] |
|
34
Wilson & Lindow 1992 [ 54] |
Unfortunately, several of the terms found and used in publications are incorrect, misleading or even paradoxical, especially the often used term “viable but not culturable (VBNC)”, which blurs the line between vitality and viability. To minimize confusion as much as possible we refer to Kaprelyants et al. [
4]:
“Several classifications of the physiological states of microorganisms have been presented. We have previously suggested[
3]
that all the cell types considered could be reduced to three groups, as follows: ‘viable’ to refer to a cell which can form a colony on an agar plate, ‘vital’ to refer to one which can only do so after resuscitation and ‘non-viable’ to refer to a cell which cannot do so under any tested condition. According to this terminology, dormant cells are vital” (see Table
2).
Table 2
“Glossary of terms used to describe the 3 major physiological states defined herein” (cited from[
3]
)
Viable
|
Capable of division; will form a colony on an agar plate.
|
Vital or dormant
|
Unable to divide or to form a colony on an agar plate without a preceding resuscitation phase.
|
Non-viable
|
Incapable of division; will not form a colony on an agar plate under any tested condition.
|
In accordance with [
4] we define “viable” strictly as “capable to grow”. In this respect any other tests, for example elongation tests (DVC; [
50]) or staining methods, are merely proxies, since no kind of staining can prove viability. Thus, the term “viability stain” is a misnomer
per definitionem and these stains should correctly be named “vital stains”. Unfortunately, the misnomer is frequently used by Invitrogen Ltd. (BacLight™), and is consequently – but incorrectly – adopted by the users of these vitality tests.
The BacLight™ bacterial viability kit (BacLight Assay, BLA)
According to the manufacturer [
81] BLA consists of two stains, propidium iodide (PI) and SYTO 9, which both stain nucleic acids. SYTO9 is a green fluorescing intercalating membrane permeable molecule and stains all cells. In contrast, PI is a red intercalating stain and is membrane impermeable, and is therefore excluded by “healthy” cells. The manufacturer describes that PI has a stronger affinity to nucleic acids than SYTO 9; thus, when both stains are present within a cell, SYTO 9 will be displaced from nucleic acids and the cell(s) will fluoresce in red. To prove the mechanism Stocks [
82] conducted “cell-free” physicochemical measurements with the “Viability Stain,
Bac Light”, and could reveal that the staining principle is not that simple. This author established a so-called fluorescence resonance energy transfer (FRET), though, under certain staining conditions, SYTO 9 emission surpasses the PI emission. Increasing the SYTO 9 concentration may thus enhance the PI emission, causing a double staining of cells. Stocks [
82] emphasizes several times that for an interpretation of the staining outcome
“the relative concentrations of PI, SYTO9 and DNA were of crucial importance” and
“that appropriate control or validation experiments (should be) performed”. Double staining and/or FRET was also documented by other authors [
41,
83], who examined viability parameters of viable and formaldehyde-killed or UVA-treated cultures, respectively. The following discussion of the MOLECULAR PROBES manual should be viewed in this context.
Concerning the reliability of the “viability kit” used in biofilm research we would like to directly cite the product information sheet(s) of MOLECULAR PROBES 2001; Product Information LIVE/DEAD ® BacLight™ Bacterial Viability Kit, Revised: 26-January-2001, as well as Revised: 15-July-2004 (whereby the latter represents the most current version in October 2013) [
81]:
(I)
“… stains differ both in their spectral characteristics and their ability to penetrate healthy bacterial cells. When used alone, the SYTO 9 stain generally labels all bacteria in a population – those with intact membranes and those with damaged membranes. In contrast propidium iodide penetrates only bacteria with damaged membranes, causing a reduction in the SYTO 9 stain when both dyes are present. Thus, with an appropriate mixture of the SYTO 9 and propidium iodide stains, bacteria with intact cell membranes stain fluorescent green, whereas bacteria with damaged membranes stain fluorescent red."
(II)
“Staining Bacteria with either Kit L7007 or L7012 - 4.1 Adjust the E. coli suspensions (live and killed) to 1 × 10
8
bacteria/mL or the S. aureus suspensions (live and killed) to 1 × 10
7
bacteria/mL. S. aureus suspensions typically should be 10-fold less concentrated than E. coli for fluorescence microscopy.” Hence the numbers of E. coli and S. aureus differ by one logarithm.
(III)
Lastly, due to (I) a mixture of 50% living and 50% dead bacteria does not normally lead to a 50/50 green/red fluorescence. And vice versa: 50% of green fluorescing bacteria in a sample does not mean that there are 50% vital cells. Therefore, “green/red fluorescence ratios (have to be) calculated for each proportion of live/dead E. coli.”
This last point was confirmed by Hannig et al. [
6] in their paper concerning the viability of
S. mutans in vitro. They determined that an
“initial concentration of viable bacteria in the assay” of 50% leads to different
“ratio emission vital/emission dead bacteria” values of about 9.5, 6.5 or 8.0 in their NaCl-control samples. Furthermore, Hannig et al. [
6] state that
“the proportion of avital bacteria increased as indicated by the ratio of avital to vital cells. It ranged between 0.1… and … 29.0 … After rinsing with chlorhexidine, the ratio amounted to 190 (6 h) or 10.2 (12 h)…” Taking the information of their Figure one into account, it remains unclear what these different ratios actually may mean in terms of “real” vitality values.
In summary, the three above-mentioned points described in the BLA user’s manual show that, in accordance with Stocks [
82], an “
appropriate mixture” of the two stains must be determined and established for any single species of bacteria before using them in an experiment. This may be possible in an
in vitro situation, where the BLA may help to save time in the long run following critical, careful, and time consuming calibration steps for each individual bacterial species. However, this is impossible to manage in naturally occurring biofilms due to the uniqueness of the plaque matrix biofilms where in reality there may be more or less than 1000 different microbial species [
84]. Moreover the application of the SYTO 9/PI stain is not considered suitable for biofilms, because of diffusion phenomena due to exo-polymers that result
“in an underestimation of viable counts”[
21].
However, there seems to be an additional drawback. Giertsen et al. [
23] criticized considerable discrepancies between expected biofilm vitality and the outcome of the SYTO 9/PI staining method. Moreover, these authors described an instable behavior and a change of the staining color from green to red,
i.e. towards monitoring more “dead” bacteria. Just recently these findings were explained by Tawakoli et al. [
7] postulating that stained cells lost their viability shortly after intercalation of the dyes, writing:
“This hypothesis was confirmed by the TEM analysis in the current study (Figure two). The images showed immense lysis and destruction of the adherent cells ….”
Table
3 lists a plethora of studies [
5‐
7,
9,
16‐
20,
22,
23,
41,
71,
73],[
82,
83,
85‐
98] that used the BLA. However, in almost all studies it had to cleared up (a) if plaque-like biofilms or, at least, saliva samples were evaluated, or if the investigations were made with single bacterial species and/or artificial (monospecies) biofilms; (b) if the staining regime of the biofilm samples was adjusted according to the BLA manual (validation); (c) which dilution factor was used between SYTO 9 and PI, due to a validation procedure according to (b); (d) how the incubation procedure was followed, especially the incubation time of the bacterial samples together with the stains’ mixture; (e) whether the results of the BLA were compared to other parameters, and if the latter were appropriate (g) or not (f); (h) if the BLA results fit with the other parameters (whether appropriate or not). Last not least we were interested (i) whether the BLA results met the expectations of the users.
Table 3
Background information concerning the use of the BacLight® assay (BLA) for assessment of (dental) biofilm vitality
Studies using the BLA (n = 30) | |
(a) Were plaque-like biofilms evaluated? | No: [ 5, 16‐ 18, 22, 23, 41, 82, 83, 86],[ 87, 93, 96‐ 98] |
Yes: [ 6, 7, 9, 19, 20, 71, 73, 85],[ 88‐ 92, 94, 95] |
(b) Validation | No: [ 5, 7, 9, 16‐ 18, 20, 22, 23, 41],[ 71, 73, 82, 85‐ 91, 94‐ 98] |
Yes: [6, 19?, 83, 92, 93?] |
(c) Dilution factor | |
1:1 [ 6, 7, 9, 17, 20, 22, 41, 71],[ 85, 87‐ 92, 96, 97] |
|
|
|
|
(d) Incubation procedure | |
|
15 min, RT [ 5, 9, 17, 19, 22, 23, 41, 73],[ 85, 87‐ 91, 97, 98] |
|
|
|
(e) Comparison | No: [ 16‐ 18, 20, 22, 71, 82, 83, 85, 88],[ 90, 91, 94, 95] |
Yes: [ 5‐ 7, 9, 19, 23, 41, 73, 86, 87],[ 89, 92, 93, 96‐ 98] |
(f) … with inappropriate methods | [ 6, 7, 9, 19, 23, 41, 73, 86],[ 87, 89, 92, 93, 96‐ 98] |
(g) … with appropriate methods | |
(h) Did the BLA results fit to the other parameters? | |
|
(i) Did the BLA results meet the expectations of the user(s)? | |
|
From the 30 investigations listed in Table
3 one half (15) was classified in the rubric “plaque-like biofilm”. This high portion is due to the fact that we endeavored to consider literature concerning oral biofilms. Natural saliva was also included
e.g.[
89‐
91] as well as “microcosm plaques”, which were grown in an artificial mouth and/or were for example established from saliva [
20,
71,
92] or from a subgingival plaque sample [
96]. Some other studies dealt with deep-sea sediment bacteria or wastewater samples [
73,
93], which were considered by us as natural multispecies systems.
It is astonishing, but expected, that only five studies were based on a preceding validation procedure (line (b) in Table
3) [
6,
19,
83,
92,
93], from which two cases were even questionable [
19,
93]. Nevertheless, a calibration could be assumed there. The description of Filoche et al. [
92] clarifies the laborious methodology (cf. their Materials & Methods section, paragraphs 2.5 Generation of the viability standard; 2.6 Preparation of the individual plaque viability standard, 2.7 Preparation of the pooled viability standard; 2.8 Staining protocol for Live/Dead® BacLight™ and 2.9 Fluorescence measurement and data analysis). Surprisingly, the calibration procedure of these authors [
92] even seemed to work when samples and controls were fixed in 4% paraformaldehyde, and/or were stored for up to three months.
As a consequence of having no validation the utmost users rely in the manufacturer’s advices regarding the dilution factor between the two stains SYTO 9 and PI. Only four research groups did not follow the recommended 1:1 dilution. Interestingly, their factors span a range from 1:6 up to 4:1 (line (c) in Table
3). This generally mirrors the seductive nature of the staining procedure [
13] and the requests the researchers have towards an easy and quick application. The concern of Stocks [
82] that the relative concentrations of PI, SYTO9 and the nucleic acids are of crucial importance is mostly neglected by the users.
At first glance the same holds true for the incubation procedure (line (d) in Table
3), however, this might be a more severe problem. Twenty-two of the users did not state the procedure or relied on the manufacturer’s instructions. Some others reduced the recommended 15-minute incubation to 10 minutes [
6,
7,
20], while others extended the incubation time between the BacLight stains and their samples to 20 or even 30 minutes [
83,
92‐
94]. However, the finding of Tawakoli et al. [
7] that the stained cells under investigation changed or even lost their viability shortly after intercalation of the dyes suggests that the “simple” incubation time is a crucial and potentially destructive factor. It is to question whether a time of 10, 20 or 30 minutes (“in the dark”, but at room temperature, and only once at 2°C [
93]) may exert a deteriorating effect on the outcome of the staining procedure. This is of specific importance when the influence of antibacterial substances is assessed like the widely used and often studied chlorhexidine (CHX) or essential oils (EO), which affect the integrity of the bacterial cell membrane.
All three phenomena - FRET and double staining [
41,
82,
83], potential impact of exopolymers [
21], and the observation of decreasing vitality during the staining procedure [
7] - point towards an overestimation of PI,
i.e., of dead cells. For an example Tomás and colleagues assessed the effects of CHX in saliva as measured with the aid of the BLA [
89‐
91]. Especially 30 seconds after rinsing with CHX they revealed a very strong bactericidal action. However, their magnitude was in line with former (independent) conventional plating assays of the same research group [
99]. Noteworthy, this research group had acceptable outcomes with the use of the staining procedures and presented convincing data regarding the antibacterial effect of different CHX concentrations, rinsing regimes [
89,
90] and influencing factors [
91]. In sum, however, it cannot be cleared whether there is an artificial shifting towards “dead” values as long as no concomitant comparison with appropriate conventional parameters is made.
Different trials concerning comparisons were conducted by the BLA users (line (e) in Table
3), with the exception of [
5] altogether with inappropriate methods. In our opinion (see next paragraph) only the plating efficiency (PE) as a relative parameter is appropriate, but not the CFU. Decker [
5] registered the total bacterial counts (BC) as well as the CFU, both parameters being the mathematical basis to calculate the PE [
8]. Nevertheless, she did not determine the corresponding PE values. Therefore, the positive and negative conclusions regarding the reliability of her different staining procedures cannot be justified.
Assessments of CFU were conducted by different authors, either in independent earlier publications [
99‐
101] before the authors switched to the usage of the BLA [
19,
90,
91], or simultaneously with their BLA measurements [
7,
19,
23,
86,
89,
92,
96‐
98]. Quite astonishingly all these very different author groups tried to compare the relative parameter “percentage of vital bacteria”, as monitored by the BLA, with the absolute parameter CFU as assessed by plate counting. No wonder that the counts did not fit with the BLA in 7 of the 9 cases (line (h) in Table
3).
Finally, we tried to judge whether the authors were satisfied with the outcome of the BLA (line (i) in Table
3). This was more or less true in the majority of cases, independent of validation and comparison, and independent of the agreement of the BLA with the other (inappropriate) methods. Some authors were nearly delighted [
92,
93].
Taking all objective observations into consideration, incorrect statements were published by the users of the BLA. Some of the users [
6,
7,
21,
23,
41,
71,
82,
83],[
86] even describe and discuss the shortcomings of their commercial stains. Similarly, as already mentioned, Davey [
10] in her recent review criticizes some limitations or “stumbling blocks” in flow cytometry. No single stain or staining method has been found to be suitable for all organisms [
102]. Consequently, the
modus operandi of different fluorescent stains has even been described in several reviews (see for example [
102‐
104]). The second limiting factor deserving consideration is the need for further method development and protocol adjustment, even when similar protocols have already been published. For example, microorganisms may need pretreatment, which may also be different for gram-positives and gram-negatives, such as the use of EDTA. Thus, such protocol modifications are necessary for each new bacterial species tested (“
strain- and matrix-specific optimization of the protocol”) [
10,
97,
105,
106]. Again, it should be noted that the dental biofilm comprises, in a conventional view, of as many as or even more than 1000 diverse species [
8,
84] embedded in a complex matrix [
8]. A current genetic analysis even discloses 10 000 species-level phylotypes [
107].
Dependence of the results of vitality testing on the stain’s concentration
Stocks [
82] clearly stated that not only the relative concentrations of PI and SYTO9, but also their relationship to DNA are of crucial importance. This equals Acridine Orange (AO), which is falsely named a vital stain, and which stains nucleic acids either green (this was wrongly believed to be vital) or red (this was wrongly believed to be dead). It could easily be shown that dilutions or concentrations of only factor 2 lead to remarkable shifts in the green/red images [
8]. This clarifies that the so-called “vitality values,” as assessed with AO, are dependent on its concentration, on pH and other factors, as well as on the relationship of the dye (whether in an adequate concentration or not) to the actual amount of stainable nucleic acids, DNA and/or RNA. Citations from the MOLECULAR PROBES Product Information [
81], including
(I) that an “
appropriate mixture” of the dyes is necessary for reliable testing, and
(II) that the number of bacteria to be tested has to be known and standardized in a species-specific manner, clarifies that the aforementioned statement concerning AO, in accordance with Stocks [
82], also applies to the SYTO 9/PI stain.
However, no relevant concentration-dependency was found for the FDA/EB staining. Staining solutions containing the same basic concentration of FDA/EB were applied in different studies, where, due to differing study designs, the volumes of the used staining solutions ranged from 5 μl [
112,
113] to 50 μl [
114], and even up to 500 μl [
110,
115], without changing the outcomes of the VF assessments [
8].
The majority of the commercial staining components act via
passive physicochemical distribution patterns, which are assumed to be different in (real) viable and (real) dead microbial cells. This also holds true for EB; however the color of EB cannot change due to concentration, pH or other physicochemical parameters [
8]. In contrast, the non-fluorescent FDA penetrates the cell membranes of living cells, and is cleaved only in a
metabolically active cell by different enzymes, mainly esterases [
8,
116,
117], to yield the fluorescing Fluorescein. Thus, a functioning metabolism is a necessary prerequisite for positive intracellular (vital) staining. Similar to the red EB counter stain, the green Fluorescein staining is neither hampered nor changed by physicochemical effects.