The following is a list of some generally applicable basic standards that are relevant to the requirements of this Standard.
ANSI/TIA-464-C-2002 – Requirements for PBX Switching Equipment.
ANSI/TIA-464-C-1-2004 – Requirements for PBX Switching Equipment, Addendum.
ANSI/TIA /EIA-810-AB-2000 2006 – Telecommunications –Telephone Terminal Equipment - Transmission Requirements for Narrowband Voice over IP and Voice over PCM Digital Wireline Telephones.
ANSI/T1.508-2004 – Loss Plan for Evolving Digital Networks.
TIA-912-AB-2004 2009 – Telecommunications - IP Telephony Equipment - Voice Gateway Transmission Requirements.
TIA-920.110-A-201002 – Telecommunications –Telephone Terminal Equipment - Transmission Requirements for Wideband Digital Wireline Telephones.
ITU-T Recommendation G.101 (2003) – The transmission plan.
ITU-T Recommendation P.58 (1996) – Head and Torso Simulator for Telephonometry
(Informative) – BNR Study
This annex is a version of an internal memo on the results for preferred listening levels for handsfree terminals (speakerphones), completed in the BNR (now Nortel) Subjective Assessment Lab in August 1986. This study examined user preference for speakerphone receive output levels in quiet and with several levels of room noise. Recommendations for speakerphone nominal level were given based on user response to specific levels across a range of room noise values.
Nortel provided this data in support of the development of TIA-470.120-C, These results were originally contributed to IEEE in 1995 in support of similar work, and they are consistent with TIA and ITU recommendations for the electro-acoustic levels of speakerphones.
Objectives
-
To determine preferred listening levels for loudspeaker listening.
-
To examine the relationship between preferred listening .level and room noise.
Apparatus
A loudspeaker was placed on the table 31.5 cm behind the standard handsfree position, and tilted 10 degrees upwards. Acoustically
transparent foam hung, from ceiling to table height, in front of the loudspeaker. A disconnected Northern Telecom Companion 4 terminal was placed in the standard experimental position (40 cm in front of subject). The speech samples were bandpass filtered at 500-3500 Hz to simulate the telephony speech band and equalized (per sentence) for signal level. Speech levels were measured using a B&K sound pressure meter that fed into a BT Speech Voltmeter SV6. Measurements were made using a 60 second sample of recorded speech (same samples and talkers used in the study). Levels were measured from the standard handsfree measuring position, 40 cm in front, and 30 cm above the handsfree terminal. The speaker amplifier was calibrated to produce an unattenuated speech level of 89.5 dB SPL. A programmable attenuator was used to control the speech level heard on each trial.
Procedure
Subjects sat facing the loudspeaker, with the chair placed at a natural distance from the table. A speech sample consisting of one sentence was played through the loudspeaker. Although speech samples were presented through the loudspeaker, subjects reported that speech appeared to come from the handsfree terminal.
Subjects were asked to rate whether speech samples were "Too Soft' , "OK", or "Too Loud", and indicate their response using the button box. Twelve trials were randomly presented at each of the ten speech levels shown in the table below. Three blocks of trials were given to each subject, one at each of the three room noise levels: 35, 45, and 55 dBA. Blocks occurred in random order. The 360 trials took approximately 20 minutes to complete. Results below are based on responses from 20 subjects.
Preliminary Results
As room noise increases, preferred listening level increases. At 35 dBA the preferred listening level is 70 dB SPL, at 45 dBA it is 73 dB SPL, and at 55 dBA preferred listening level is 76 dB SPL. The rate of increase of preferred listening level with increasing room noise is approximately .3 dB/dB, which is in line with Gardner's (1969) and Ericsson's (1977) results.
There appears to be a threshold between 79 and 82 dBSPL where speech is considered too loud, regardless of the room noise level.
Compression of the acceptable range of speech levels as room noise increases, as reported by Gardner and others, is not readily apparent in the present results. In all room noise conditions, there are at least three speech levels where %OK is above 80%. Substantial compression may not occur until higher room noise levels, or maybe the compression information is lost when data is averaged.
Recommendations
A good compromise for nominal level appears to be 73 dB SPL, since it gives over 90% acceptance (94,97,95) at all three noise levels. An acceptable range for volume would be 61 to 82 dB SPL; 64% found 61 dB to be too low even in the quiet room, and 55% found 82 dB to be too loud in the noisiest room. The step size should be smaller than 3 dB, since a single 3 dB step can change opinion by over 40%.
It should be remembered that the present results were obtained in the large sound room, which is not a very reverberant room. Since sound is absorbed readily in this room, subjects may prefer higher listening levels in this room than in a more reverberant room. This should be taken into account when generalizing these results to other environments.
Table of Results
The following are the acceptance of speech levels for
three levels of room noise
Table FG.1 – Room noise 35 dBA User Acceptance Levels
Room Noise = 35dBA
|
|
|
N=20
|
Speech Level (dBSPL)
|
% Too Low
|
% OK
|
% Too High
|
85
|
0
|
2
|
98
|
82
|
0
|
12
|
87
|
79
|
0
|
37
|
63
|
76
|
0
|
74
|
25
|
73
|
0
|
94
|
5
|
70
|
0
|
99
|
0
|
67
|
7
|
93
|
0
|
64
|
34
|
64
|
1
|
61
|
64
|
35
|
0
|
58
|
87
|
12
|
0
|
Table
FG.2 – Room noise 45 dBA User Acceptance Levels
Room Noise = 45dBA
|
|
|
N=20
|
Speech Level (dBSPL)
|
% Too Low
|
% OK
|
% Too High
|
85
|
0
|
5
|
94
|
82
|
1
|
24
|
74
|
79
|
0
|
61
|
38
|
76
|
1
|
88
|
10
|
73
|
1
|
97
|
1
|
70
|
4
|
95
|
0
|
67
|
25
|
74
|
0
|
64
|
66
|
33
|
0
|
61
|
86
|
13
|
0
|
58
|
97
|
2
|
0
|
Table
FG.3 – Room noise 55 dBA User Acceptance Levels
Room Noise = 55dBA
|
|
|
N=20
|
Speech Level (dBSPL)
|
% Too Low
|
% OK
|
% Too High
|
85
|
0
|
13
|
86
|
82
|
0
|
44
|
55
|
79
|
0
|
75
|
25
|
76
|
3
|
96
|
1
|
73
|
4
|
95
|
0
|
70
|
20
|
80
|
0
|
67
|
60
|
39
|
0
|
64
|
88
|
12
|
0
|
61
|
97
|
3
|
0
|
58
|
100
|
0
|
0
|
(Informative) – Items for Future Consideration
The following is a list of items for consideration in a future revision of this document.
Voice Switching Performance
Speakerphone voice switching performance can be thought of part art and part science with a number of inter-related variables. When the number of variables exceeds seven the design becomes an art. Speakerphone deals with many more than seven and many of them are not independent variables involving background noise,
switching speeds, integration times, frequency content, volume controls, voice levels, voice cadence etc. There are a variety of good design solutions that work but juggle the parameters differently.
A speakerphone must perform over a wide number of conversation dynamics among a variety of interlocutors. The speaking style, cadence, pitch, loudness, interruption initiation, even politeness of each person comes into play as well as background noise, etc. Applying a few, or even considerably more than a few, test signals to the device doesn’t guarantee that a bad switching design fails, or that a good one passes. The methods described in IEEE Std 269 & IEEE Std 1329 are complicated to implement but probably not complicated enough to do a reliable job. They may be great development or screening tools once the parameters are set for a specific design implementation.
At a minimum a correlation test with subjective tests using a variety of subjects should be conducted to verify voice switching and this is outside the scope of this document.
Single Talk Performance
Send Threshold Level
Send Build-Up Time
Send Hang-Over Time
-
Receive Build-Up Time
Receive Threshold Level
Receive Hang-Over Time
Receive Noise Guard
Attenuation Range
Double-Talk Performance
Receive to Send Switching Time (Send Switching Time)
Receive to Send Switching Threshold Level
Send to Receive Switching Time (Receive Switching Time)
Send to Receive Switching Threshold Level
Receive to Send Take-Over Time (Send Take-Over Time)
Send to Receive Take-Over Time (Receive Take-Over Time)
Acoustic Echo Canceller Performance
Round Trip Echo Path Delay
TCL Single Talk
Convergence Time
TCL Temporally Weighted Coupling Loss - Double Talk
-
Receive Speech Attenuation During Double Talk
Send Speech Front-end Clipping Time During Double Talk
Receive Speech Front-end Clipping Time During Double Talk
Acoustic Stability (TCL: Terminal Coupling Loss)
Stability Related to Low Level Stimulus
Dynamic Send Background Noise Suppression
Switching operation between speakerphone & handset mode
Send Level and Send Level Directionality
Consideration should be given to replacing the Send Loudness Rating (SLR) and Send Loudness Rating Directionality (SLRD) requirements in 4.4.1.2 and 4.4.1.3 with Send Level and Send Level Directionality (SLD) requirements in a manner analogous to the Receive Level and Receive Level Directionality (RLD) requirements in 4.4.2.1 and 4.4.2.2. The following text has been proposed.
Send Level
The CPE is expected to encounter talkers with a wide dynamic range of input speech levels. A microphone volume control (manual or automatic or both) may be present providing a means to compensate for this variable input level. Different volume control settings may be used to meet the requirements at the different test loops.
Requirement
To ensure adequate gain, the CPE shall meet the network level requirements in Table H-1.
Table G.1H-1 – Speakerphone Send Level Limits
Condition
|
Input Level
|
Loop
|
Network Level Target
|
Network Level Range
|
High Level, short loop
|
+5 dBPa
|
0 km
|
-10 dBm
|
-18 to -8 dBm
|
Nominal Level, mid loop
|
-5 dBPa
|
2.7 km
|
-20 dBm
|
-25 to -15 dBm
|
Low Level, long loop
|
-15 dBPa
|
4.6 km
|
-30 dBm
|
-32 to -22 dBm
|
NOTES:
The test signal input levels were derived as follows:
-
High Level (+5 dBPa): Highest level possible for using P.50 without the mouth simulator clipping the test signal and represents approximately two standard deviations from the mean Near-End ASL.
-
Nominal Level (-5 dBPa): Represents the mean Near-End Active Speech Level (ASL) at the CPE recorded in ???.
-
Low Level (-15 dBPa): Represents approximately two‘x’ standard deviations from the mean Near-End Active Speech Level (ASL) at the CPE.
-
The requirements above would allow acceptable network levels with average room noise levels. For higher room noise levels, a higher talker level (i.e. +10 dBPa) would allow for a better far-end listening experience.
Method of Measurement
-
Configure the CPE and test equipment as shown in Figure 1 and Figure 3.
-
The test signal for this test is the ITU-T P.50 male artificial speech signal.
-
Set the mouth simulator for a level at MRP per the levels in G .1Table H-1:
-
The measurement is be made over a minimum range of 100 Hz through 8000 Hz using real-time analysis with 1/3 octave or smaller bands averaged over the entire duration of the test signal.
-
The 100 Hz through 8000 Hz measurement result is then power summed and expressed in terms of dBm.
Send Level Directionality (SLD)
Requirement
The SLD values should be within the limits given in G .2Table H-2.
Table G.2H-2 – Send Level Directionality Limits.
Angle
Degrees CCW
|
SLD
(dB)
|
30
|
+/- 3
|
60
|
< -5
|
300
|
< -5
|
330
|
+/- 3
|
NOTES:
-
The intent is to have consistent level when the talker is +/- 30 degrees from the center axis.
-
The intent is to ensure that there is no more than 5dB change when the talker is +/- 60 degrees from the center axis.
Method of Measurement
-
The Send Level is first measured according to I.5.1.2 at the 0 degree angle at -5 dBPa with a 2.7 km artificial line.
-
The speakerphone is then rotated about its physical center 30 degrees counter-clockwise. Take care to keep the MRP of the artificial mouth at a constant distance from the physical center of the speakerphone, see Figure G .1Figure 8. Repeat the Send Level measurement.
-
Successively rotate the speakerphone about its physical center to the 60, 300, and 330 degree position and repeat the Send Level measurement at each position.
-
Calculate the SLD for each angle as the difference between the Send Level at that angle and the Send Level at 0 degrees.
Editorial Note: The title of Figure 8 will need to be changed from “Speakerphone Send LR Directionality Measurement Setup” to “Speakerphone Send Level Directionality Measurement Setup.”
NOTES:
-
The Speakerphone setup at the 0 Degree position is using the Reference Triangle.
-
For the other test positions the Speakerphone is rotated around its the physical center.
-
Test table is at least 1 meter square.
Figure G.1 – Speakerphone Send Level Directionality Measurement Setup