ArticlesAll Issue
ArticlesEnhancement of Speech Encryption/Decryption Process Using RSA Algorithm Variants
• Eman Abouelkheir1,2,* and Shamia El-Sherbiny1

Human-centric Computing and Information Sciences volume 12, Article number: 06 (2022)
https://doi.org/10.22967/HCIS.2022.12.006

Abstract

Recently, speech encryption attracts many researchers because of the various applications of speech communications such as; e-learning, e-banking, military, teleconferencing and other fields. In this work, a new modification on RSA (Rivest–Shamir–Adleman) algorithm is proposed to enhance the performance of conventional RSA up on application in audio cryptosystems. This paper is concerned with speech encryption and decryption based on the well-known RSA algorithm and some of its variants, including our own suggestion. The performance of both the original RSA algorithm and its variants is investigated and tested through estimating some parameters that give the indication of audio cryptography quality. The parameters that are estimated in the experimental test are; mean square error between the original signal and the decrypted signal, linear predictive code measure (LPC), cepstral distance measure (CD), the segmental signal-to-noise ratio (SSNR) and the execution time. Based on the estimated parameters, a performance comparison between the investigated algorithms is introduced. The obtained results show that the RSA algorithm and its variants are efficient to secure the audio communications and our new proposed modification reduces the processing time approximately by 39%–53%, compared to the original RSA algorithm and hence it is efficient in real time applications.

Keywords

Speech Encryption/Decryption, RAS Algorithm Variants, Linear Predictive Code Measure (LPC), Cepstral Distance Measure (CD), Segmental Signal-to-Noise Ratio (SSNR)

Instruction

Data reliability, secrecy, accessibility and confidentially are the main issues in communication process security. Cryptography is widely used in communication systems to secure and protect data. Cryptography plays an effective rule in various applications such as e-mail, e-commerce, sending financial information, pay-TV, and so forth. Up on application of cryptography, the meaning of the message is hided and the plain text is converted cipher text through encryption phase and the reverse process is carried out through decryption phase and hence, the insecure physical channel can be regarded as a secure logical channel [1]. Cryptography is classified into two main types known as symmetric cryptography and asymmetric cryptography. In symmetric cryptography a single key for encryption and decryption is used, whereas the asymmetric cryptography uses two keys: public key for encryption and private key for decryption. The asymmetric techniques are more secure than symmetric techniques but they take longer processing time. RSA (Rivest–Shamir–Adleman) is one of the most widely used asymmetric techniques which find applications in many fields such as email encryption, SSL/TLS certificates, cryptocurrencies and many other applications [24]. The reasons of the popularity of RSA are its reliability and ease of implementation. A number of modifications on RSA are recently conducted to enhance its performance [57].
Recently, audio messages, as an essential form of data, can be exchanged over different communication channels and hence, there is a dire need for audio encryption. In the case of audio encryption, the audio message is transformed to an ambiguous form. There are several research papers which are concerned with the audio encryption on the basis of some well-known encryption algorithms such as scrambling, elliptic curve cryptography (ECC), chaotic encryption and RSA algorithms [3, 812]. Various encryption techniques which comprise more than one algorithm have been proposed and applied to audio messages in recent years. Researches in [13], used the hyperchaotic system and the modified Henon map to encrypt the speech signal which is compressed by fast Walsh Hadamard transform. Other group in [14] encrypted the audio data using two stages: the first is block ciphering based on DNA encoding and logistic map and the second stage is based on channel shuffling to enhance the security. Authors in [15], used DNA coding and chaotic systems to encrypt audio messages, the new in this algorithm is the usage of hash value of the message to control the initiation of the chaotic system. Researchers in [16], combined the discrete wavelet transform (DWT) with the measured biometric features extracted from human hand geometry to carry out the speech encryption. In [17], both ECC and 3DES algorithms are used together to achieve audio encryption during transmission through mobile network. A new audio encryption technique based on combining chaotic systems and fast Fourier transform (FFT) has been introduced in [18], this technique uses two chaotic systems: one is the logistic map and the other is 3D Lorenz chaotic system to encrypt the speech message which is initially scrambled using FFT. Authors in [19], combined discrete cosine transform (DCT) with the scrambling algorithm to construct the speech encryption module as a part of speech retrieval process in cloud environment. In [20], the security of audio transmission is enhanced through combining four different encryption techniques: cipher feedback encryption, dynamic DNA coding, chaotic maps, and self-adaptive scrambling encryption. Most of the above stated work focused on the degree of security of the proposed algorithms, but there is another side that should be taken into account especially in real-time applications. This is the encryption and decryption times and from this point of view we think that simple algorithms will be preferred for audio encryption. Audio encryption based on RSA algorithm have been presented by some researchers using various implementation techniques.
The aim of this work is the enhancement of audio encryption through investigating the application of the RSA algorithm and its variants. For this purpose, we surveyed the RSA algorithm and its recent modifications, a new modification on RSA is introduced as one contribution of this work and then audio cryptosystems based on RSA and its variants, including our own developed one, are implemented. The performance of audio encryption is investigated by determining some audio quality metrics such as linear predictive code measure (LPC), mean square error (MSE) between the original signal and the decrypted signal, cepstral distance measure (CD) and the segmental signal-to-noise ratio (SSNR) [21]. Also, in this work, we are concerned with evaluating the encryption and decryption times as they are effective in the case of real time applications.
The rest of this paper is organized as follows. Section 2 presents the related work based on asymmetric cryptography. Section 3 illustrates RSA algorithm and its variants including our proposed modification. The audio transmission with a cryptosystem is presented in Section 4. Section 5 presents the audio quality metrics that are used to investigate the performance of the algorithms. Results and discussion are presented in Section 6, a comparison with some current systems is introduced in Section 7, and finally, Section 8 gives the main conclusions.

Related Work

In the introduction section a general overview of speech encryption based on various techniques was introduced. This section focuses on the efforts based on classical techniques. There are a considerable number of propositions that were made on the application of asymmetric key cryptography in multimedia transmission. Authors in [22] modified El-Gamal cryptosystem to be applied over gray and color images, both encryption and decryption scenarios worked well. A combination of El-Gamal and scan methods was introduced in [23] to encrypt image. In [24], El-Gamal algorithm was utilized to enhance the security of speech transmission over open and shared networks. Authors in [25] protected the transmission of speech by applying cryptosystem which is based on Diffie-Hellman algorithm. Due to the simplicity of RSA, ease of implementation, low computational complexity and difficulty of breaking, various efforts were introduced to develop voice cryptosystems which are based on RSA algorithm.
The technique presented by researchers in [26], is based on saving different speech words from different speakers in a wave file, extracting data from the wave file and saving it in a text file as integer data and then performing the encryption and decryption processes on the integer data. In [27], a new encryption technique based on symmetric cryptography was suggested and applied for audio encryption, the results of the suggested technique were compared with the obtained results in the case of audio encryption based on RSA, and the suggested method produced a decrypted signal with higher quality. The performance of audio encryption based on RSA in terms of audio quality metrics was investigated in [28], the results obtained in that work ensures the validity of RSA in secure audio transmission as well as high quality of the recovered message. In addition to using RSA for audio encryption, some researchers investigated its application for video encryption. Most of researchers suggested multi-layer techniques for video encryption to increase the security. Video encryption based on RSA and ECC was presented in [29]. Also, [30] utilized dual layer for video encryption, the first layer is based on RSA whereas the second layer is based on pseudo-noise sequence. For further improvement of video encryption security, authors in [31] utilized a hybridization algorithm that consists of three layers: the first is based on RSA algorithm, the second is based on DES algorithm, and the third is a combination of both of them. In this work, we propose a new modification on RSA, in addition to applying RSA and its variants in speech security enhancement. Experimental investigations are concerned with quality metrics measurement of both encrypted and recovered speech signals as well as examining the improvements in RSA speed up on applying different variants in speech encryptions. To evaluate the effectiveness of our proposed technique it will be compared to both classical techniques and the most recently developed techniques which are based on chaotic cryptography [18, 20, 3234]. In [18], the authors present a new speech encryption technique in which a 3D Lorenz-logistic map is introduced and used to generate three random number sequences which are used to permute the initial speech signal and the real and imaginary parts of its FFT. The author in [20] used three encryption techniques (DNA, self-adaptive scrambling, and cypher feedback encryption) in addition to chaotic maps to secure the audio transmission. Authors in [32] proposed synchronized chaotic systems at both transmitter and receiver to achieve speech encryption in case of multi-user communication. In [33], DWT is combined with the chaotic map in audio encryption to enhance the storage and transfer efficiencies. Authors in [34] encrypted the speech signal using both of cryptography protocols and chaotic maps, in this work different types of one-dimensional maps are used, the protection of parameters are carried out using blowfish algorithm, hashing algorithm is used to authenticate the blowfish key and the shared data which in turn increased the security of the system.

RSA Algorithm Modifications

This section introduces an overview on RSA and some of its modified versions.

The Original RSA Algorithm and Some of its Recent Variants
The original RSA algorithm was proposed in 1977 by Ron Rivest, Adi Shamir, and Leonard Adleman [2], and it consists of three main steps (key generation, encryption phase, and decryption phase) which can be summarized by the flowchart illustrated in Fig. 1. Various modifications have been carried out to enhance the performance of RSA, one of those modifications is based on generating different values of public key [5]. This modification resulted in an enhancement in the security with a slight increase in encryption time and no change in decryption time. Fig. 2 presented the flow involved in the modification based on public key to enhance the security of RSA keys. Another modification on RSA which is illustrated in Fig. 3 provides higher security level and higher speed than the original RSA by using four numbers to overcome the factorization problem of the integer n [6]. The research [7] enhanced the performance of RSA through using dynamic keys for data encryption. In that work, five primes were used and hence the decomposition of the modulus n by intruders became more difficult, the details of this algorithm is illustrated in Fig. 4.

 Fig. 1. Original RSA algorithm. Fig. 2. Modified RSA algorithm based on the public key. Fig. 3. RSA algorithm based four primes. Fig. 4. RSA algorithm based five numbers.

Proposed Approach
Our proposed approach aims to enhance the performance of the original RSA through the combination of the two features introduced in both [6] and [7]. In [6], the author proposed another modification on RSA which provides higher security level and higher speed than the original RSA by using five numbers to overcome the factorization problem of the integer n. The research [7] enhanced the security of RSA through using dynamic keys for data encryption. The proposed algorithm used the dynamic keys [6] with five primes [7], so, for security and time enhancement. This algorithm overcomes the integer factorization problem and increases the speed of the original RSA. The algorithm details are as follows:
Key generation
1 Select large prime numbers p and q
2 Select any three random numbers r, s, t.
3 Compute n=p*q*r*s*t.
4 Compute
5 Collect at maximum 10 values for the public key e, such that , and e is coprime with n and .
6 Select public key e,
7 Calculate
8 Select d under the condition
9 The public key pair is (f, n) and the secrete key pair is (d, n).

Encryption
$C = M^{((f-1)/2)}modn$

Decryption
$M = C^d modn$
The previous stated steps of our proposed modification is summarized in Fig. 5.

Fig. 5. Modified RSA algorithm based on both of public key and five primes.

Audio Transmission with a Cryptosystem

The audio transmission with a cryptosystem shown in Fig. 6, can be summarized as:

Audio data collection from audio signal.

Encryption of the collected data at transmitter using an encryption technique.

Transmission of the encrypted data through the communication channel.

Decryption of the received data at the receiver to recover the original message.

Fig. 6. Audio transmission with cryptosystem.

Audio Quality Metrics with a Cryptosystem

The performance of audio cryptosystem can be evaluated by measuring the quality of the processed speech. There are two categories of speech quality metrics: subjective and objective [21, 31]. The subjective metrics depend on the impression of the listener about the intelligibility of speech, whereas, the objective metrics depend on the original speech and the processed speech, they can be estimated using some mathematical expressions. There are a number of widely used objective metrics such as LPC, also known as log likelihood ratio (LLR), SSNR, CD, MSE between the original signal and the processed signal and correlation between the original signal and the processed signal [21, 35, 36].

LLR
The LLR measures the distance between the vectors of linear prediction coefficients determined for the original and processed speech.

$LLR = log\left(\frac{a_xR_ya_x^T}{a_yR_ya_y^T}\right)$(1)

SSNR
SSNR is the average signal-to-noise ratio over a number of short frames, it can be calculated using

$SSNR=\frac{!0}{M}\displaystyle\sum_{m=0}^{M-1}log_{10}\frac{\displaystyle\sum_{i=Nm}^{Nm+N-1}x^2(i)}{\displaystyle\sum_{i=Nm}^{Nm+N-1}(x(i)-y(i))^2}$(2)

where x(i) is the original speech, y(i) is the processed speech, N represents the frame length and M is the total number of frames 21, 36].

CD
CD is a measure based on cepstrum coefficients and can be calculated using

$CD=10log_{10}\left[2\displaystyle\sum_{n=1}^{p}{C_x(n)-C_y(n)}^2\right]^{1/2}$(3)

where Cx and Cy are the cepstral vectors of the original speech and the processed speech, respectively [21].

Correlation between Original Signal and Processed Signal
The correlation coefficient can be expressed as

$r_{xy}=\frac{C_v(x,y)}{\sqrt{D(x)}\sqrt{D(y)}}$(4)

where Cv(x, y) is the covariance between the original and processed signals. D(x) and D(y) are the variances of x and y, respectively [21].

MSE between Original Signal and Processed Signal
MSE can be expressed as

$MSE=\frac{1}{Ns}\displaystyle\sum_{m=1}^{Ns}(x(m)-y(m))^2$(5)

where Ns is the number of samples.

Experimental Results and Discussion

Our goal in this section is the investigation of the quality of the encrypted speech and the decrypted speech up on the application of RSA algorithm variants to the original speech. For this purpose, we used different types of speech signals: a single word spoken by different speakers “zero” and two different long sentences. The performance of RSA variants as well as the performance of our proposed modification have been tested via MATLAB experimental implementation using lab top with Intel processor core i3, 4 GB RAM, 64-bit operating system.
The simulation steps can be summarized as follows. First, different audio files are obtained through recording different words and sentences, by different speakers, and saving them in WAV format using sampling rate of 8 kHz and sample length of 16 bits. Secondly, the MATLAB code is built to implement the encryption and decryption process. The process starts, for each algorithm, with entering the prime numbers required, then developing the code which represents the mathematical equation describing the three phases: key generation, encryption, and decryption illustrated in Section 3. Determining the quality metrics of both the encrypted and decrypted signals is an indication to the security level and performance of the algorithm.
We started our investigation by estimating the quality metrics, stated in Section 4, between the original speech and decrypted speech, as well as processing time for one audio word, recorded by different persons, as illustrated in Tables 1–3.

Table 1. Quality metrics of decryption phase (between original speech and decrypted speech) for speaker 1

Algorithm Quality metric
MSE SSNR CD LLR rxy
Original RSA [2] 3.42E-11 46.3226 -21.4082 -1.01E-07 1
New public two primes [5] 3.42E-11 46.3226 -21.4082 -1.01E-07 1
Four primes [6] 3.42E-11 46.3226 -21.4082 -1.01E-07 1
Five primes [7] 3.42E-11 46.3226 -21.4082 -1.01E-07 1
Proposed approach 3.42E-11 46.3226 -21.4082 -1.01E-07 1

Table 2. Quality metrics of decryption phase (between original speech and decrypted speech) for speaker 2
Algorithm Quality metric
MSE SSNR CD LLR rxy
Original RSA [2] 3.41E-04 35.4115 0.1533 -0.3152 0.9955
New public two primes [5] 3.41E-04 35.4115 0.1504 -0.3146 0.9955
Four primes [6] 4.49E-04 35.4115 0.6251 -0.3878 0.9904
Five primes [7] 7.11E-04 35.4115 1.7046 -0.6367 0.9834
Proposed approach 7.07E-04 35.4115 2.0515 -0.7401 0.9834

Table 3. Quality metrics of decryption phase (between original speech and decrypted speech) for speaker 3
Algorithm Quality metric
MSE SSNR CD LLR rxy
Original RSA [2] 3.41E-11 46.9749 -19.9846 -2.81E-08 1
New public two primes [5] 3.41E-11 46.9749 -19.9846 -2.81E-08 1
Four primes [6] 3.41E-11 46.9749 -19.9846 -2.81E-08 1
Five primes [7] 3.41E-11 46.9749 -19.9846 -2.81E-08 1
Proposed approach 3.41E-11 46.9749 -19.9846 -2.81E-08 1

The original RSA algorithm [2] and the modified RSA [5] based on the public key run using primes 11 and 17. The modified RSA [6] using four primes runs using 5, 7, 11, and 17. The modified RSA [7] using five primes runs using 3, 5, 7, 11, and 17. Also, the proposed approach runs using 3, 5, 7, 11, and 17.
Also, we try different prime numbers and the decrypted signal did not change. So the small prime numbers previously used in our program just to try the speech encryption. The proposed based on five primes algorithm is compatible to use any large or small prime with the same recovery efficiency. Our algorithm based on five prime numbers, so this increase the running time and decrease the factorization problem.
It can be noticed from the results listed above that SSNR, obtained for all speakers upon application of different algorithms, has high positive values whereas MSE, CD, and LLR have low values which means that the decrypted audio signal has a high quality and a good precision [3]. Also the results show very high correlation between the original signals and the recovered signals.
To study the performance of RSA variants at the encryption phase, the SSNR of the encrypted signals as well as the MSE between the original signals and their encrypted versions are listed in Table 4.
It is obvious from Table 4 that SSNR in all cases has high negative values, more negative values of SSNR refer to more powerful scheme, and MSE has high positive values which indicates the extreme degradation of the encrypted signals from the original transmitted signals. In general, the obtained results of both the encryption phase and decryption phase ensure the effectiveness of RSA algorithm and its variants in audio encryption. The results in case of our proposed modification ensures that it is more secure than original RSA and other variants. To investigate the competition between various RSA variants, the processing times in different cases are listed in Tables 5–7. The results shown in Tables 5–7 indicates an improvement in processing time upon application of RSA variants. The processing time of our proposed modification is the smallest compared to the others and hence it competes in real-time applications.

Table 4. SSNR and MSE of the encrypted signals
Algorithm Speaker 1 Speaker 2 Speaker 3
SSNR MSE SSNR MSE SSNR MSE
Original RSA [2] 3.41E-11 46.9749 -19.9846 -2.81E-08 1
New public two primes [5] 3.41E-11 46.9749 -19.9846 -2.81E-08 1
Four primes [6] 3.41E-11 46.9749 -19.9846 -2.81E-08 1
Five primes [7] 3.41E-11 46.9749 -19.9846 -2.81E-08 1
Proposed approach 3.41E-11 46.9749 -19.9846 -2.81E-08 1

Table 5. Processing time for different speakers (speaker 1)
Algorithm Encryption time (s) Decryption time (s) Total processing time (s) Saving time of the proposed approach
Original RSA [2] 0.6012 0.7215 1.3227 53%
New public two primes [5] 0.7557 0.9447 1.7004 63%
Four primes [6] 0.4528 0.4528 0.9056 31%
Five primes [7] 0.3671 0.3388 0.7059 12%
Proposed approach 0.2972 0.3242 0.6214 -

Table 6. Processing time for different speakers (speaker 2)
Algorithm Encryption time (s) Decryption time (s) Total processing time (s) Saving time of the proposed approach
Original RSA [2] 0.5636 0.6576 1.2212 48%
New public two primes [5] 0.5709 0.6661 1.237 48%
Four primes [6] 0.4168 0.3789 0.7957 20%
Five primes [7] 0.398 0.332 0.729 12%
Proposed approach 0.3074 0.333 0.6404 -

Table 7. Processing time for different speakers (speaker 3)
Algorithm Encryption time (s) Decryption time (s) Total processing time (s) Saving time of the proposed approach
Original RSA [2] 0.5573 0.7431 1.3003 50%
New public two primes [5] 0.8266 1.0333 1.8596 62%
Four primes [6] 0.3889 0.4278 0.8159 14%
Five primes [7] 0.4362 0.3635 0.7997 7%
Proposed approach 0.3395 0.3638 0.7033 -

To visualize the difference between the encrypted signals and their corresponding original signals, time domain plots of both original signal and decrypted signal, MSE plots, cross correlation plots as well as histogram analysis can be used as shown in Figs. 7 and 8. In the histogram, the information is represented as numbers, the closeness of these numbers indicates the good performance of the encryption process [10]. Fig. 8 shows the histogram analysis of the processed word, spoken by different three speakers. The encryption algorithm used in this case is based on our proposed modification. As shown in Fig. 8, the encrypted signals have sample values with equal probability, and hence the attacker cannot read out the useful message. The results shown in Figs. 7 and 8 indicate the high security level and good encryption quality of the proposed modification.

Fig. 7. The original signal, decrypted signal, MSE, and correlation between the encrypted and decrypted signal for the speaker 1, word zero using the proposed approach.

Fig. 8. Histogram analysis of original, encrypted and decrypted speech of different speakers using the new proposed approach: (a) speaker1, (b) speaker 2, and (c) speaker 3.

Table 8. Quality metrics between original speech and decrypted speech for sentence 1
Algorithm Quality metric
MSE SSNR CD LLR rxy
Original RSA [2] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
New public two primes [5] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
Four primes [6] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
Five primes [7] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
Proposed approach 3.00E-11 35.4115 -6.4819 -4.93E-04 1

Table 9. Processing time for sentence 1
Algorithm Quality metric
MSE SSNR CD LLR rxy
Original RSA [2] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
New public two primes [5] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
Four primes [6] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
Five primes [7] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
Proposed approach 3.00E-11 35.4115 -6.4819 -4.93E-04 1

Table 10. Quality metrics between original speech and decrypted speech for sentence 2
Algorithm Quality metric
MSE SSNR CD LLR rxy
Original RSA [2] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
New public two primes [5] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
Four primes [6] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
Five primes [7] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
Proposed approach 3.00E-11 35.4115 -6.4819 -4.93E-04 1

Table 11. Processing time for sentence 2
Algorithm Quality metric
MSE SSNR CD LLR rxy
Original RSA [2] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
New public two primes [5] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
Four primes [6] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
Five primes [7] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
Proposed approach 3.00E-11 35.4115 -6.4819 -4.93E-04 1

Fig. 9. The original signal, decrypted signal, MSE, and correlation between the encrypted and decrypted signal for the sentence 1, word zero using the proposed approach.

Fig. 10. The original signal, decrypted signal, MSE, and correlation between the encrypted and decrypted signal for the sentence 2, word zero using the proposed approach.

In addition to the investigation of performance which based on single word, more investigations of the RSA variants in audio cryptosystem, are carried out using long sentences. The quality metrics as well as the processing time are determined and listed in Tables 8–11. The results presented in Tables 8–11, plots in Figs. 9 and 10 as well as the histogram analysis presented in Fig. 11 ensure the efficiency of all RSA variants in audio encryption and indicate that all RSA variants produces the same quality. Using RSA variants has additional advantages in terms of cryptosystem security against attacks and processing time, our new proposed modification outperforms the others in terms of processing time.

Fig. 11. Histogram analysis of original via the proposed approach, encrypted and decrypted speech: (a) sentence 1 and (b) sentence 2.

Comparison of the Proposed System with Some Current Work

In this section a comparison between the proposed technique and some current work is made to evaluate the system performance in terms of MSE, SSNR and rxy. To do this comparison, different audio signals with different lengths are applied to the proposed cryptosystem, the quality metrics of the encrypted signals are listed in Table 12. Fig. 12 presents the histogram of one of the used sentences, sentence 1. The comparison between the results of the proposed approach and those of other work found in literature, is displayed in Table 13.

Table 12. Quality metrics of the processed signals via the proposed technique

Algorithm Quality metric
MSE SSNR CD LLR rxy
Original RSA [2] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
New public two primes [5] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
Four primes [6] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
Five primes [7] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
Proposed approach 3.00E-11 35.4115 -6.4819 -4.93E-04 1

Fig. 12. Histogram analysis of original, encrypted and decrypted speech via the proposed approach.

The SSNRs in Table 13 determined for the decrypted signals in [37], [38], and [39] are 121.2, 45.1, and 33.827 dB, respectively whereas in case of the proposed technique SSNR of the decrypted signal reached 79.785 dB.
Also, our scheme takes approximately from 0.028 to 0.033 seconds to encrypt 1 kB of data and increase with increasing file size. So, it is faster than the scheme in [40] as it takes from 0.23 to 0.51 to encrypt 1 kB of data. Also, the proposed scheme is faster than the scheme in [20], as it takes from 0.19 to 0.37 to encrypt 1 kB of data. The obtained results and the above comparison indicates the good performance of the suggested scheme.

Table 13. Comparison between the results of the proposed approach and others
Algorithm Quality metric
MSE SSNR CD LLR rxy
Original RSA [2] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
New public two primes [5] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
Four primes [6] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
Five primes [7] 3.00E-11 35.4115 -6.4819 -4.93E-04 1
Proposed approach 3.00E-11 35.4115 -6.4819 -4.93E-04 1

Conclusion

The main aim of audio encryption is to protect audio systems from illegal access, disruption or modification. This work investigates the performance of audio cryptosystems based on RSA algorithm and its variants. The work started by reviewing the modifications introduced to enhance the security of the original RSA algorithm and a new modification is also suggested as one of the contributions of this work. The performance is investigated by measuring some audio quality metrics such as SSNR, LLR, MSE, and CD for the encrypted and decrypted signals. The obtained results ensure the effectiveness of RSA and its variants in audio cryptosystems. On brief, the application of RSA variants enhances the security and reduces the processing time and hence they are efficient in real time applications and our proposed modification strongly competes in this area. In future, our proposed cryptosystem will be enhanced through combining this technique with other encryption techniques such as DNA and chaotic systems. Also, compression techniques can be used before encryption to reduce the running time to make our cryptosystem more suitable for real-time applications.

Author’s Contributions

Conceptualization, SES, EA. Funding acquisition, EA. Investigation and methodology, SES, EA. Project administration SES, EA. Resources SES, EA. Supervision SES. Writing of the original draft SES, EA. Writing of the review and editing SES, EA. Software SES, EA. Validation SES. Formal analysis SES, EA. Data curation SES, EA. Visualization SES, EA. All the authors have proofread the final version.

Funding

This research funded by Qassim University.

Acknowledgement

The researchers would like to thank the Deanship of Scientific Research, Qassim University for funding the publication of this project

Competing Interests

The authors declare that they have no competing interests.

Author Biography

Name : Eman Abouelkheir
Affiliation : Department of Computer Science, College of Science and Arts, Qassim University, Alrass, 51452, Saudi Arabia
Department of Electrical Engineering, College of Engineering Kafrelsheikh, Kafrelsheikh University, Kafrelsheikh, 33516, Egypt
Biography : Was born in Saudi Arabia in 1986. She receives B.Sc. from Faculty of Engineering Kafrelsheikh University in 2008. She received M.Sc and Ph.D. from Faculty of Engineering, Alexandria University. She is currently Assistant Professor in Department of computer Science Faculty of Sciences and Arts, Qassim Univerity. She is also as Lecturer Department of Electrical Engineering Faculty of Engineering Kafrelsheikh University.

Name : Shamia El-sherbiny
Affiliation : Department of Electrical Engineering, College of Engineering Kafrelsheikh, Kafrelsheikh University, Kafrelsheikh, 33516, Egypt
Biography : Was born in Kafelsheikh in 1978. She received the B.Sc and M.Sc from Faculty of Engineering Tanta University. She received Ph.D. from Faculty of Engineering Menoufia University in 2014. She is currently a Lecture in the Department of Electrical Engineering Faculty of Engineering Kafrelsheikh University.

References

[1] M. Barakat, C. Eder, and T. Hanke, An Introduction to Cryptography, 2nd ed. Kaiserslautern, Germany: University of Kaiserslautern, 2018.
[2] R. L. Rivest, A. Shamir, and L. Adleman, “A method for obtaining digital signatures and public-key cryptosystems,” Communications of the ACM, vol. 21, no. 2, pp. 120-126, 1978.
[3] J. I. Okonkwo, G. O. Ozor, and F. A. Okoye, “Performance analysis of RSA algorithm for audio data security in communication networks,” International Journal of Latest Technology in Engineering, Management & Applied Science, vol. 8, no. 9, pp. 48-52, 2019.
[4] H. M. El Bakry, A. E. Taki El Deen, and A. H. El Tengy, “Implementation of an encryption scheme for voice calls,” International Journal of Computer Applications, vol. 144, no. 2, pp. 24-27, 2016.
[5] C. Intila, B. Gerardo, and R. Medina, “A study of public key ‘e’ in RSA algorithm,” IOP Conference Series: Materials Science and Engineering, vol. 482, no. 1, article no. 012016, 2019. https://doi.org/10.1088/1757-899x/482/1/012016
[6] R. M. Pir, “Security improvement and speed monitoring of RSA algorithm,” International Journal of Engineering Development and Research, vol. 4, no. 1, pp. 195-200, 2016.
[7] L. P. Saikia, “Simulation and analysis of modified RSA cryptographic algorithm using five prime numbers,” International Journal on Recent and Innovation Trends in Computing and Communication, vol. 5, no. 6, pp. 224-228, 2017.
[8] A. Mahdi, A. K. Jawad, and S. S. Hreshee, “Digital chaotic scrambling of voice based on duffing map,” International Journal of Information and Communication Sciences, vol. 1, no. 2, pp. 16-21, 2016.
[9] S. Priyanka and B. Hemalatha, “Speech data encryption and decryption using elliptic curve cryptography,” International Journal of Research in Computer Science, vol. 3, no. 1, pp. 48-53, 2016.
[10] M. F. Abd Elzaher, M. Shalaby, and S. H. El Ramly, “Securing modern voice communication systems using multilevel chaotic approach,” International Journal of Computer Applications, vol. 135, no. 9, pp. 17-21, 2016.
[11] A. Ghasemzadeh and E. Esmaeili, “A novel method in audio message encryption based on a mixture of chaos function,” International Journal of Speech Technology, vol. 20, no. 4, pp. 829-837, 2017.
[12] O. M. Al-Hazaimeh, “A new dynamic speech encryption algorithm based on Lorenz chaotic map over internet protocol,” International Journal of Electrical and Computer Engineering, vol. 10, no. 5, article no. 4824-4834, 2020. https://doi.org/10.11591/ijece.v10i5.pp4824-4834
[13] F. J. Farsana, V. R. Devi, and K. Gopakumar, “An audio encryption scheme based on Fast Walsh Hadamard Transform and mixed chaotic keystreams,” Applied Computing and Informatics, 2020. https://doi.org/10.1016/j.aci.2019.10.001
[14] P. K. Naskar, S. Paul, D. Nandy, and A. Chaudhuri, “DNA encoding and channel shuffling for secured encryption of audio data,” Multimedia Tools and Applications, vol. 78, no. 17, pp. 25019-25042, 2019.
[15] X. Wang and Y. Su, “An audio encryption algorithm based on DNA coding and chaotic system,” IEEE Access, vol. 8, pp. 9260-9270, 2019.
[16] Z. N. Al-kateeb and S. J. Mohammed, “A novel approach for audio file encryption using hand geometry,” Multimedia Tools and Applications, vol. 79, no. 27, pp. 19615-19628, 2020.
[17] Z. Chang and M. Wozniak, “Encryption technology of voice transmission in mobile network based on 3DES-ECC algorithm,” Mobile Networks and Applications, vol. 25, no. 6, pp. 2398-2408, 2020.
[18] P. Sathiyamurthi and S. Ramakrishnan, “Speech encryption algorithm using FFT and 3D-Lorenz–logistic chaotic map,” Multimedia Tools & Applications, vol. 79, no. 25-26, pp. 17817-17835, 2020.
[19] Q. Y. Zhang, L. Zhou, T. Zhang, and D. H. Zhang, “A retrieval algorithm of encrypted speech based on short-term cross-correlation and perceptual hashing,” Multimedia Tools and Applications, vol. 78, no. 13, pp. 17825-17846, 2019.
[20] R. I. Abdelfatah, “Audio encryption scheme using self-adaptive bit scrambling and two multi chaotic-based dynamic DNA computations,” IEEE Access, vol. 8, pp. 69894-69907, 2020.
[21] P. C. Loizou, “Speech quality assessment,” in Multimedia Analysis, Processing and Communications. Heidelberg, Germany: Springer, 2011, pp. 623-654.
[22] H. R. Hashim and I. A. Neamaa, “Image encryption and decryption in a modification of ElGamal cryptosystem in MATLAB,” International Journal of Sciences: Basic and Applied Research, vol. 14, no. 2, pp. 141-147, 2014.
[23] R. Mohan, H. L. Dhruw, and Raghvendra, “An effective image encryption based on the combination of scan and ElGamal method,” International Journal of Engineering and Computer Science, vol. 4, no. 5, pp. 11793-11796, 2015.
[24] O. A. Imran, S. F. Yousif, I. S. Hameed, W. N. A. D. Abed, and A. T. Hammid, “Implementation of El-Gamal algorithm for speech signals encryption and decryption,” Procedia Computer Science, vol. 167, pp. 1028-1037, 2020.
[25] S. F. Yousif, “Secure voice cryptography based on Diffie-Hellman algorithm,” IOP Conference Series: Materials Science and Engineering, vol. 1076, no. 1, article no. 012057, 2021. https://doi.org/10.1088/1757-899x/1076/1/012057
[26] M. M. Rahman, T. K. Saha, and M. A. A. Bhuiyan, “Implementation of RSA algorithm for speech data encryption and decryption,” International Journal of Computer Science and Network Security, vol. 12, no. 3, pp. 74-82, 2012.
[27] M. I. Khalil, “Real-time encryption/decryption of audio signal,” International Journal of Computer Network and Information Security, vol. 8, no. 2, pp. 25-31, 2016.
[28] S. F. Yousif, “Encryption and decryption of audio signal based on RSA algorithm,” International Journal of Engineering Technologies and Management Research, vol. 5, no. 7, pp. 57-64, 2018.
[29] P. Kumari, U. Kumar, and S. K. Singh, “Dual-layer video encryption using RSA and ECC algorithm,” International Journal of Scientific and Research Publications, vol. 6, no. 7, pp. 620-625, 2016.
[30] S. N. Sayyad, P. S. Sutar, R. S. Pise, V. H. Raut, and C. V. Nalawade, “Dual-layer video encryption & decryption using RSA algorithm,” International Journal of Innovative Research in Computer and Communication Engineering, vol. 5, no. 4, pp. 7661-7668, 2017.
[31] E. J. Sharma and J. Rani, “An efficient hybrid approach for secure speech cryptography,” International Journal of Computer Science and Mobile Computing, vol. 6, no. 1, pp. 23-29, 2017.
[32] S. Hashemi, M. A. Pourmina, S. Mobayen, and M. R. Alagheband, “Multiuser wireless speech encryption using synchronized chaotic systems,” International Journal of Speech Technology, vol. 24, pp. 651-663, 2021.
[33] A. S. Hameed, “Speech compression and encryption based on discrete wavelet transform and chaotic signals,” Multimedia Tools and Applications, vol. 80, no. 9, pp. 13663-13676, 2021.
[34] G. Kaur, K. Singh, and H. S. Gill, “Chaos-based joint speech encryption scheme using SHA-1,” Multimedia Tools and Applications, vol. 80, no. 7, pp. 10927-10947, 2021.
[35] N. F. Soliman, Z. Mostfa, F. E. Abd El-Samie, and M. I. Abdalla, “Performance enhancement of speaker identification systems using speech encryption and cancelable features,” International Journal of Speech Technology, vol. 20, no. 4, pp. 977-1004, 2017.
[36] H. N. Abdullah, S. S. Hreshee, and A. K. Jawad, “Design of efficient noise reduction scheme for secure speech masked by chaotic signals,” Journal of American Science, vol. 11, no. 7, pp. 49-55, 2015.
[37] H. B. A. Wahab and S. I. Mahdi, “Modify speech cryptosystem based on shuffling overlapping blocks technique,” International Journal of Emerging Trends & Technology in Computer Science, vol. 4, no. 2, pp. 70-75, 2015.
[38] S. N. Al Saad and E. Hato, “A speech encryption based on chaotic maps,” International Journal of Computer Applications, vol. 93, no. 4, pp. 19-28, 2014.
[39] A. H. Khaleel and I. Q. Abduljaleel, “A novel technique for speech encryption based on k-means clustering and quantum chaotic map,” Bulletin of Electrical Engineering and Informatics, vol. 10, no. 1, pp. 160-170, 2021.
[40] H. K. Kate, J. Razmara, and A. Isazadeh, “A novel fast and secure approach for voice encryption based on DNA computing,” 3D Research, vol. 9, article no. 17, 2018. https://doi.org/10.1007/s13319-018-0167-x

Eman Abouelkheir1,2,* and Shamia El-Sherbiny1, Enhancement of Speech Encryption/Decryption Process Using RSA Algorithm Variants, Article number: 12:06 (2022) Cite this article 2 Accesses