Introduction
Information theory can be defined as a science whose fundamental elements are the quantification, loading and transmission of information. Information theory is studied according to two scenarios, the classical and the quantum. In the classical scenario, basic elements of classical information theory are bits, while the transmission, processing and load of information obey the laws of classical mechanics [1]. In the quantum context, quantum information theory introduces a new paradigm for processing and transmitting information through quantum channels. Unlike classical systems, quantum information theory is governed by the principles of quantum mechanics, leveraging phenomena such as superposition and entanglement. This implies that information can be represented not only in the form of classical bits but also as quantum bits, commonly known as qubits. Whereas bits can only take on two values, qubits can exist in a superposition of both states simultaneously, providing greater flexibility and computational power in quantum information processing (see Fig. 1). However, the approach to concepts of information theory in the quantum setting is carried out with an analogous strategy to the classical setting [2], [3].
(a) The classic bit has two well-defined values, either 0 or 1. (b) The qubit can be a superposition of the states
The study of channel capacity is one of the applications of information theory and has a number of practical implications. In conceptual terms, the capacity of classical channels is a number that indicates the asymptotic rate whose information can be reliably transmitted through the channel. Due to features of quantum mechanics such as entanglement and superposition, it is possible to transmit information over quantum channels in a variety of ways. Naturally, the capacity of these channels can be defined in different ways, each depending on the type of information being transmitted, making it possible to transmit classical information or quantum states [4], [5]. Regarding the transmission of classical information, there are two examples of quantum channel capacity definitions. The quantum channel capacity was defined by Holevo-Schumacher-Westmoreland (HSW), as the maximum asymptotic rate in which classical information can be reliably transmitted using quantum encoding and decoding [6], [7]. Another example of definition for quantum channel capacity is the classical entanglement capacity
The capacities mentioned previously are defined considering that the probability decreases exponentially as the code size increases. Shannon defined the classical zero-error capacity of channels as the maximum rate achieved when the probability of error is equal to zero [9]. Since its definition, zero-error capacity has been studied in various scenarios [10]. The zero-error capacity of quantum channels was defined by Medeiros and Assis [11] as the supreme of rates in which classical information can be transmitted by a quantum channel with an error probability exactly equal to zero. Thus, using quantum block coding, classical information is mapped into quantum states and these states are transmitted through a noisy quantum channel. As part of the process, quantum states are measured on reception and, in this case, the transmission needs to be carried out with exactly zero-error rate.
An important aspect in the study of zero-error capacity of quantum channels is to verify whether a quantum channel has positive zero-error capacity or not. It is well known that determining the HSW capacity of a quantum channel is NP-Complete, while determining the zero-error capacity of a quantum channel is QMA-Complete [12]. However, even determining whether a channel’s capacity is positive can be a challenging problem [13]. Three conditions are known in the literature to verify the zero-error capacity condition of quantum channels, which are found in Medeiros and Assis [11], [14] and Gupta et al. [15]. In addition to these zero-error capacity conditions, Oliveira et al. [16], proves another condition to verify the zero-error capacity of quantum channels, which takes into account whether Kraus operators representing the quantum channel have at least two common eigenstates.
The relation between eigenstates common to the Kraus operators and the zero-error capacity of quantum channels is useful to relate quantum zero-error information theory and Shemesh theorem. The Shemesh theorem establishes a criterion that guarantees the existence of a common subspace for two square matrices. In other words, the theorem states that two square matrices \begin{equation*}\mathcal {M}= \cap _{r,s=1}^{d-1}\text {ker}\ [A_{1}^{r}, A_{2}^{s}]\end{equation*}
The main objective of this paper is to establish results concerning the relationship between Shemesh theorem and zero-error quantum information theory. The motivation behind this study stems from Shemesh theorem, which provides a criterion for linear operators to share a common eigenstate. Notably, every eigenstate common to the Kraus operators of a quantum channel is also a fixed point of the channel.
This paper is organized as follows: Section II presents the definition of a quantum channel and some basic properties. In addition, it discusses the zero-error capacity of a quantum channel, which is fundamental to understanding the main objective of this work. Section III presents the statement of Shemesh theorem along with its corresponding generalization. Section IV demonstrates results related to the connection between zero-error quantum information theory and Shemesh theorem. Finally, Section V presents the conclusions.
A. Related Work
Shemesh theorem presents a criterion for two square matrices to have a common eigenstate [17]. After the publication of the original idea by Dan Shemesh [17], other researchers proposed studies related to the Shemesh criterion. The criterion was generalized to a finite number of square matrices in [18].
One of the resulting problems related to the result proposed by Shemesh is the investigation of invariant common subspace for matrices. When it comes to investigating conditions for two square matrices
Jamiołkowski and Pastuszak [18] presented a criterion guaranteeing the existence of a common invariant subspace for a finite number of matrices in
The concept of a common invariant subspace plays a significant role in Quantum Information Theory. Jamiołkowski [26] used the Shemesh theorem and the generalized Shemesh theorem to analyze the properties of quantum control systems. Farenick [27] related the concept of invariant common subspace to irreducible quantum channels, showing that a quantum channel represented by Kraus operators is irreducible if, and only if, the Kraus operators have no common invariant subspace. Another relevant question is the relationship between common invariant subspace and the concept of decoherence-free subspace (DFS) [28], [29]. To understand this relationship, we should consider that quantum channels are used to transmit information, but the information can be corrupted by decoherence [30]. To reduce the effect of decoherence, one can hide the information in a decoherence-free subspace (DFS). Using the generalized Shemesh theorem, one can decide when the algebra generated by the Kraus operators of the quantum channel has a DFS with dimension
It is also important to highlight the work of Białonczyk et al. [31], which uses the Shemesh theorem to analyze the spectral properties of the Kraus operators associated with a quantum channel, as well as the connection that these spectral properties have with the algebra generated by the Kraus operators of the quantum channel. The spectral properties of the Kraus operators of a quantum channel are essential for determining the asymptotic dynamics of quantum systems subject to multiple interactions described by the same quantum channel.
In the context of Quantum Zero-Error Information Theory, we found no existing literature that explores a connection between Shemesh theorem and this field. More specifically, our research indicates that there are no published works linking Shemesh theorem to the concept of zero-error capacity in quantum channels, highlighting the novelty of this investigation.
B. Notations and Definitions
In this paper, we always refer to Hilbert spaces of finite dimension d and denote them by
Zero-Error Capacity of Quantum Channels
In this section we discuss the main definitions and known results on zero-error capacity of quantum channels is presented, which are important to facilitate the reading and understanding of this paper [11], [32].
A quantum channel
Let \begin{equation*} \mathcal {E}(\rho)=\sum _{i=1}^{\kappa }A_{i}\rho A_{i}^{\dagger }. \tag {1}\end{equation*}
When considering a quantum channel \begin{equation*} p(j|i)=\text {tr}[\sigma _{i}M_{j}]=\text {tr} [\mathcal {E}(\rho _{i})M_{j}]. \tag {2}\end{equation*}
A quantum
a set of indices
, where each index is associated with a classical message;\{1,\ldots,m\} a coding function
leading to codewords\begin{equation*} f_{n}:\{1,\ldots,m\}\longrightarrow \mathcal {S}^{\otimes n} \tag {3}\end{equation*} View Source\begin{equation*} f_{n}:\{1,\ldots,m\}\longrightarrow \mathcal {S}^{\otimes n} \tag {3}\end{equation*}
, where eachf_{n}(1)=\overline {\rho _{1}},\ldots, f_{n}(m)=\overline {\rho _{m}} ;\overline {\rho _{i}} \in \mathcal {S}^{\otimes n} A decoding function
which deterministically associates a message with one of the possible measurement\begin{equation*} g:\{1,\ldots,k\}\longrightarrow \{1,\ldots,m\} \tag {4}\end{equation*} View Source\begin{equation*} g:\{1,\ldots,k\}\longrightarrow \{1,\ldots,m\} \tag {4}\end{equation*}
performed by POVMy \in \{1,\ldots,k\} . Furthermore, the decoding function has the following property:\{M_{i}\}_{i=1}^{k} for all\begin{equation*} \text { Pr}[g(\mathcal {E}(f_{n}(i)))\neq i]=0 \tag {5}\end{equation*} View Source\begin{equation*} \text { Pr}[g(\mathcal {E}(f_{n}(i)))\neq i]=0 \tag {5}\end{equation*}
.i \in \{1,\ldots,m\}

Definition 1 Zero-Error Capacity of Quantum Channels[11]):
The quantum zero-error capacity of a quantum channel \begin{equation*} C^{(0)}(\mathcal {E})=\text {sup}_{\mathcal {S}}\text {sup}_{n}\ \frac {1}{n} \text { log}\ m \tag {7}\end{equation*}
The decoding error is related to the ability to distinguish between two states. Two quantum states are distinguishable if, and only if, the Hilbert spaces generated by the supports of these quantum states are orthogonal. Thus, two quantum states
Consequently, certain tests for determining whether a channel has positive zero-error capacity (e.g., [11, Proposition 1] and [14, Proposition 1]) assert that a quantum channel \begin{equation*} S:=\text {span}\{A_{i}^{\dagger } A_{j}\} \tag {8}\end{equation*}
There is a relation between zero-error capacity and fixed point of the quantum channel
Proposition 2[11]:
Let
The proof of this result can be found in [11, Proposition 2].
Next, it is proved every eigenstate common to all Kraus operators representing the quantum channel is a fixed point of this channel. Based on this result, one can conclude that zero-error capacity of quantum channel is positive if all the Kraus operators have at least two common eigenstates.
Lemma 3[16]:
Let a quantum channel be
Proof:
We just need to show that \begin{align*} \mathcal{E}(|\psi\rangle\langle\psi|) & \stackrel{(\mathrm{a})}{=} \sum_{i=1}^\kappa A_i|\psi\rangle\langle\psi| A_i^{\dagger} \stackrel{(\mathrm{b})}{=} \sum_{i=1}^\kappa \lambda_i|\psi\rangle \lambda_i^*\langle\psi| \tag{9}\\ & \stackrel{(\mathrm{c})}{=}|\psi\rangle\langle\psi| \sum_{i=1}^\kappa\left|\lambda_i\right|^2 \stackrel{(\mathrm{~d})}{=}|\psi\rangle\langle\psi|, \tag{10}\end{align*}
Define
Theorem 4[16]:
Let
Proof:
The Lemma 3 showed that states in
The condition for positive zero-error capacity of a quantum channel established in Theorem 4 is highly practical. Given the d possible distinct eigenstates in the Hilbert space, it is sufficient to verify that at least two of these eigenstates are common to the Kraus operators representing the channel.
The Shemesh Theorem
In order to show the relation between Shemesh theorem and quantum zero-error information theory, in this section we will present Shemesh theorem in the non-generalized version with proof, and state the generalized theorem.
Theorem 5 (Shemesh Theorem[17]):
Let \begin{equation*} \mathcal {M}= \cap _{r,s=1}^{d-1}\text {ker}\ [A_{1}^{r}, A_{2}^{s}] \tag {11}\end{equation*}
The demonstration of Shemesh theorem can be found in [17].
Observation 6:
The subspace of Shemesh theorem is contained in \begin{equation*} \mathcal {M}= \cap _{r,s=1}^{d-1}\text {ker}[A_{1}^{r}, A_{2}^{s}] \subset \mathbb {C}^{d}. \tag {12}\end{equation*}
The Theorem 5 can be generalized to a quantity s of matrices and give conditions for them to have common eigenstates. This generalization occurs due to the use of the standard polynomial concept and the Amitsur-Levitzki theorem. The generalized Shemesh Theorem is stated below and it can be found in [18, Theorem 2.4].
Theorem 7 (Generalized Shemesh Theorem[18]):
The matrices \begin{equation*} \mathcal {M}= \cap _{\alpha _{i},l_{i}=1}^{d-1}\text {ker}\ [A_{1}^{\alpha _{1}}\ldots \ A_{\kappa }^{\alpha _{\kappa }}, A_{1}^{l_{1}}\ldots \ A_{\kappa }^{l_{\kappa }}] \tag {13}\end{equation*}
Note that the sufficient and necessary condition for matrices
The Zero-Error Capacity of Quantum Channels and Shemesh Theorem
In this section, results are presented so that we can construct the relation between Shemesh theorem and quantum zero-error information theory.
Lemma 8:
Let
Proof:
By hypothesis \begin{equation*} A^{2}\left \vert {{\varphi }}\right \rangle = A(A \left \vert {{\varphi }}\right \rangle)= A(\lambda \left \vert {{\varphi }}\right \rangle)=\lambda A\left \vert {{\varphi }}\right \rangle =\lambda ^{2}\left \vert {{\varphi }}\right \rangle. \tag {14}\end{equation*}
\begin{equation*} A^{k}\left \vert {{\varphi }}\right \rangle =\lambda ^{k}\left \vert {{\varphi }}\right \rangle. \tag {15}\end{equation*}
\begin{align*} A^{k+1}\left \vert {{\varphi }}\right \rangle =A(A^{k}\left \vert {{\varphi }}\right \rangle)=A(\lambda ^{k}\left \vert {{\varphi }}\right \rangle)=\lambda ^{k}\lambda \left \vert {{\varphi }}\right \rangle =\lambda ^{k+1}\left \vert {{\varphi }}\right \rangle. \tag {16}\end{align*}
Proposition 9:
Let be a quantum channel \begin{equation*} \mathcal {M}= \cap _{\alpha _{i},l_{i}=1}^{d-1}\text {ker}\ [A_{1}^{\alpha _{1}}\ldots A_{\kappa }^{\alpha _{\kappa }}, A_{1}^{l_{1}}\ldots A_{\kappa }^{l_{\kappa }}] \tag {17}\end{equation*}
Proof:
Suppose that \begin{align*} A_{1} \left \vert {{\varphi }}\right \rangle = \lambda _{1} \left \vert {{\varphi }}\right \rangle,\ A_{2} \left \vert {{\varphi }}\right \rangle = \lambda _{2} \left \vert {{\varphi }}\right \rangle,\ldots,A_{\kappa }\left \vert {{\varphi }}\right \rangle = \lambda _{\kappa }\left \vert {{\varphi }}\right \rangle. \tag {18}\end{align*}
\begin{align*}& [A_{1}^{a_{1}}\ldots A_{\kappa }^{a_{\kappa }},A_{1}^{b_{1}}\ldots A_{\kappa }^{b_{\kappa }}] \\ & = (A_{1}^{a_{1}}\ldots A_{\kappa }^{a_{\kappa }})(A_{1}^{b_{1}}\ldots A_{\kappa }^{b_{\kappa }})-(A_{1}^{b_{1}}\ldots A_{\kappa }^{b_{\kappa }})(A_{1}^{a_{1}}\ldots A_{\kappa }^{a_{\kappa }}). \tag {19}\end{align*}
Note that\begin{align*}& ((A_{1}^{a_{1}}\ldots A_{\kappa }^{a_{\kappa }})(A_{1}^{b_{1}}\ldots A_{\kappa }^{b_{\kappa }})-(A_{1}^{b_{1}}\ldots A_{\kappa }^{b_{\kappa }})(A_{1}^{a_{1}}\ldots A_{\kappa }^{a_{\kappa }}))\left \vert {{\varphi }}\right \rangle \tag {20}\\ & =(A_{1}^{a_{1}}\ldots A_{\kappa }^{a_{\kappa }})(A_{1}^{b_{1}}\ldots A_{\kappa -1}^{b_{\kappa -1}})A_{\kappa }^{b_{\kappa }}\left \vert {{\varphi }}\right \rangle \\ & \quad -(A_{1}^{b_{1}}\ldots A_{\kappa }^{b_{\kappa }})(A_{1}^{a_{1}}\ldots A_{\kappa -1}^{a_{\kappa -1}})A_{\kappa }^{a_{\kappa }}\left \vert {{\varphi }}\right \rangle \tag {21}\\ & =(A_{1}^{a_{1}}\ldots A_{\kappa }^{a_{\kappa }})(A_{1}^{b_{1}}\ldots A_{\kappa -1}^{b_{\kappa -1}})\lambda _{\kappa }^{b_{\kappa }}\left \vert {{\varphi }}\right \rangle \\ & \quad -(A_{1}^{b_{1}}\ldots A_{\kappa }^{b_{\kappa }})(A_{1}^{a_{1}}\ldots A_{\kappa -1}^{a_{\kappa -1}})\lambda _{\kappa }^{a_{\kappa }}\left \vert {{\varphi }}\right \rangle \tag {22}\\ & =\lambda _{\kappa }^{b_{\kappa }}(A_{1}^{a_{1}}\ldots A_{\kappa }^{a_{\kappa }})(A_{1}^{b_{1}}\ldots A_{\kappa -1}^{b_{\kappa -1}})\left \vert {{\varphi }}\right \rangle \\ & \quad -\lambda _{\kappa }^{a_{\kappa }}(A_{1}^{b_{1}}\ldots A_{\kappa }^{b_{\kappa }})(A_{1}^{a_{1}}\ldots A_{\kappa -1}^{a_{\kappa -1}})\left \vert {{\varphi }}\right \rangle \tag {23}\\ & =\lambda _{\kappa }^{b_{\kappa }}(A_{1}^{a_{1}}\ldots A_{\kappa }^{a_{\kappa }})(A_{1}^{b_{1}}\ldots A_{\kappa -2}^{b_{\kappa -2}})A_{\kappa -1}^{b_{\kappa -1}}\left \vert {{\varphi }}\right \rangle \\ & \quad -\lambda _{\kappa }^{a_{\kappa }}(A_{1}^{b_{1}}\ldots A_{\kappa }^{b_{\kappa }})(A_{1}^{a_{1}}\ldots A_{\kappa -2}^{a_{\kappa -2}})A_{\kappa -1}^{a_{\kappa -1}}\left \vert {{\varphi }}\right \rangle \tag {24}\\ & =\lambda _{\kappa }^{b_{\kappa }}\lambda _{\kappa -1}^{b_{\kappa -1}}(A_{1}^{a_{1}}\ldots A_{\kappa }^{a_{\kappa }})(A_{1}^{b_{1}}\ldots A_{\kappa -2}^{b_{\kappa -2}})\left \vert {{\varphi }}\right \rangle \\ & \quad -\lambda _{\kappa }^{a_{\kappa }}\lambda _{\kappa -1}^{a_{\kappa -1}}(A_{1}^{b_{1}}\ldots A_{\kappa }^{b_{\kappa }})(A_{1}^{a_{1}}\ldots A_{\kappa -2}^{a_{\kappa -2}})\left \vert {{\varphi }}\right \rangle \\ & \quad \vdots \hspace {2cm} \vdots \hspace {2cm} \vdots \hspace {2cm} \vdots \tag {25}\\ & =\lambda _{\kappa }^{b_{\kappa }}\lambda _{\kappa -1}^{b_{\kappa -1}}\ldots \lambda _{1}^{b_{1}} \lambda _{\kappa }^{a_{\kappa }}\lambda _{\kappa -1}^{a_{\kappa -1}}\ldots \lambda _{1}^{a_{1}}\left \vert {{\varphi }}\right \rangle \\ & \quad -\lambda _{\kappa }^{a_{\kappa }}\lambda _{\kappa -1}^{a_{\kappa -1}}\ldots \lambda _{1}^{a_{1}}\lambda _{\kappa }^{b_{\kappa }}\lambda _{\kappa -1}^{b_{\kappa -1}}\ldots \lambda _{1}^{b_{1}}\left \vert {{\varphi }}\right \rangle \tag {26}\\ & =(\lambda _{\kappa }^{b_{\kappa }+a_{s}}\lambda _{\kappa -1}^{b_{\kappa -1}+a_{\kappa -1}}\ldots \lambda _{1}^{b_{1}+a_{1}} \\ & \quad -\lambda _{\kappa }^{a_{\kappa }+b_{\kappa }}\lambda _{\kappa -1}^{a_{\kappa -1}+b_{\kappa -1}}\ldots \lambda _{1}^{a_{1}+b_{1}})\left \vert {{\varphi }}\right \rangle \tag {27}\\ & = 0 \cdot \left \vert {{\varphi }}\right \rangle =0. \hspace {5.74cm} \tag {28}\end{align*}
\begin{equation*} \mathcal {M}= \cap _{\alpha _{i},l_{i}=1}^{d-1}\text {ker}\ [A_{1}^{\alpha _{1}}\ldots A_{\kappa }^{\alpha _{\kappa }}, A_{1}^{l_{1}}\ldots A_{\kappa }^{l_{\kappa }}]. \tag {29}\end{equation*}
The Proposition 9 shows that if \begin{equation*}{{\mathcal {M}}_{\mathcal {E}}=\{ \left \vert {{\varphi }}\right \rangle \left \langle {{\varphi }}\right \vert \in \mathcal {M}: \mathcal {E}(\left \vert {{\varphi }}\right \rangle \left \langle {{\varphi }}\right \vert)=\left \vert {{\varphi }}\right \rangle \left \langle {{\varphi }}\right \vert \}},\end{equation*}
Corollary 10:
Proof:
Let
Proposition 11:
If
Proof:
By hypothesis the states belongs to
The result of Proposition 11 establishes a connection between Shemesh theorem and the zero-error capacity of the quantum channel.
Example 12:
Let \begin{align*}A_{1}& = \sqrt {p}\left ({{ \begin{array}{ccccc} 1 & \quad 0 & \quad 0 & \quad 0 \\[7pt] 0 & \quad \dfrac {1}{2} & \quad \dfrac {\sqrt {3}}{2} & \quad 0 \\[7pt] 0 & \quad \dfrac {\sqrt {3}}{2} & \quad -\dfrac {1}{2} & \quad 0 \\[7pt] 0 & \quad 0 & \quad 0 & \quad 1 \\ \end{array} }}\right), \\ A_{2}& = \sqrt {1-p}\left ({{ \begin{array}{ccccc} 1 & \quad 0 & \quad 0 & \quad 0 \\[7pt] 0 & \quad \dfrac {1}{2} & \quad - \dfrac {\sqrt {3}}{2} & \quad 0 \\[7pt] 0 & \quad -\dfrac {\sqrt {3}}{2} & \quad -\dfrac {1}{2} & \quad 0 \\[7pt] 0 & \quad 0 & \quad 0 & \quad 1 \\ \end{array} }}\right).\end{align*}
\begin{align*}A_{1}= \left ({{ \begin{array}{ccccc} \dfrac {1}{2} & \quad 0 & \quad 0 & \quad 0 \\[7pt] 0 & \quad \dfrac {1}{4} & \quad \dfrac {\sqrt {3}}{4} & \quad 0 \\[7pt] 0 & \quad \dfrac {\sqrt {3}}{4} & \quad -\dfrac {1}{4} & \quad 0 \\[7pt] 0 & \quad 0 & \quad 0 & \quad \dfrac {1}{2} \\ \end{array} }}\right), \\ A_{2}= \left ({{ \begin{array}{ccccc} \dfrac {\sqrt {3}}{2} & \quad 0 & \quad 0 & \quad 0 \\[7pt] 0 & \quad \dfrac {\sqrt {3}}{4} & \quad - \dfrac {3}{4} & \quad 0 \\[7pt] 0 & \quad -\dfrac {3}{4} & \quad -\dfrac {\sqrt {3}}{4} & \quad 0 \\[7pt] 0 & \quad 0 & \quad 0 & \quad \dfrac {\sqrt {3}}{2} \\ \end{array} }}\right).\end{align*}
Conclusion
In this paper, we established a relationship between Shemesh theorem and zero-error quantum information theory. We first defined a quantum channel using Kraus operators on a finite-dimensional Hilbert space and introduced the concept of zero-error capacity. We particularly emphasized a test for zero-error capacity based on the eigenstates common to the Kraus operators. We then analyzed Shemesh theorem and demonstrated its connection to zero-error quantum information theory by proving that every eigenstate common to the Kraus operators, which serves as a fixed point of the channel, belongs to the Shemesh subspace. It is also important to recognize that there may be additional fixed points within the Shemesh subspace beyond those eigenstates common to the Kraus operators. Given that a lower bound for the zero-error capacity of a quantum channel can be associated with the number of its fixed points, we highlight the connection between Shemesh theorem and zero-error quantum information theory, particularly in the context of zero-error capacity.