使用 SouceDataLine 播放生成的声音是模糊的
Generated sound played with SouceDataLine is fuzzy
我正在尝试实时生成一组同步音调。但是程序产生的所有声音都是 "fuzzy",或者有 "static",甚至在背景中听起来像 "squealing"。这在低音调的声音中尤其明显。
这是代码:
static final long bufferLength = 44100;
static final AudioFormat af = new AudioFormat(bufferLength, 8, 1, true, false);
static boolean go = true; //to be changed somewhere else
static void startSound(double[] hertz) {
if (hertz.length == 0) {return;}
try {
SourceDataLine sdl = AudioSystem.getSourceDataLine(af);
sdl.open();
sdl.start();
int i = 0;
//iterate as long as the sound must play
do {
//create a new buffer
double[] buf = new double[128]; //arbitrary number
final int startI = i;
//iterate through each of the tones
for (int k = 0; k < hertz.length; k++) {
i = startI;
//iterate through each of the samples for this buffer
for (int j = 0; j < buf.length; j++) {
double x = (double)i/bufferLength*hertz[k]*2*Math.PI;
double wave1 = Math.sin(x);
//decrease volume with increasing pitch
double volume = Math.min(Math.max(300 - hertz[k], 50d), 126d);
buf[j] += wave1*volume;
i++;
if (i == 9999999) { //prevent i from getting too big
i = 0;
}
}
}
final byte[] finalBuffer = new byte[buf.length];
//copy the double buffer into byte buffer
for (int j = 0; j < buf.length; j++) {
//divide by hertz.length to prevent simultaneous sounds
// from becoming too loud
finalBuffer[j] = (byte)(buf[j]/hertz.length);
}
//play the sound
sdl.write(finalBuffer, 0, finalBuffer.length);
} while (go);
sdl.flush();
sdl.stop();
} catch (LineUnavailableException e) {
e.printStackTrace();
}
}
//play some deep example tones
startSound(new double[]{65.4064, 58.2705, 48.9995});
我试过录制从这个程序输出的声音,波形确实有点参差不齐。但是当我直接从程序中打印出生成的波浪时,它们看起来非常平滑。我发出的声音似乎与扬声器发出的声音不匹配。谁能发现我做错了什么?
根据我的评论,我认为您听到 quantization error 是因为 8 位音频,您应该切换到 16 位。量化误差有时被称为噪声,但它是一种平方谐波失真,是您听到的细微泛音的来源。
8 位有时对于语音之类的东西是可以接受的,因为它听起来更像噪音。纯音失真更明显。
我把你的代码变成了一个粗略的 MCVE 来证明差异。
class SoundTest {
static final int bufferLength = 44100;
static final AudioFormat af8 = new AudioFormat(bufferLength, 8, 1, true, false);
static final AudioFormat af16 = new AudioFormat(bufferLength, 16, 1, true, false);
static volatile boolean go = true; //to be changed somewhere else
static void startSound8(double[] hertz) {
if (hertz.length == 0) {return;}
try {
SourceDataLine sdl = AudioSystem.getSourceDataLine(af8);
sdl.open();
sdl.start();
int i = 0;
//iterate as long as the sound must play
do {
//create a new buffer
double[] buf = new double[128]; //arbitrary number
final int startI = i;
//iterate through each of the tones
for (int k = 0; k < hertz.length; k++) {
i = startI;
//iterate through each of the samples for this buffer
for (int j = 0; j < buf.length; j++) {
double x = (double)i/bufferLength*hertz[k]*2*Math.PI;
double wave1 = Math.sin(x);
//decrease volume with increasing pitch
// double volume = Math.min(Math.max(300 - hertz[k], 50d), 126d);
double volume = 64;
buf[j] += wave1*volume;
i++;
if (i == 9999999) { //prevent i from getting too big
i = 0;
}
}
}
final byte[] finalBuffer = new byte[buf.length];
//copy the double buffer into byte buffer
for (int j = 0; j < buf.length; j++) {
//divide by hertz.length to prevent simultaneous sounds
// from becoming too loud
finalBuffer[j] = (byte)(buf[j]/hertz.length);
}
//play the sound
sdl.write(finalBuffer, 0, finalBuffer.length);
} while (go);
sdl.flush();
sdl.stop();
synchronized (SoundTest.class) {
SoundTest.class.notifyAll();
}
} catch (LineUnavailableException e) {
e.printStackTrace();
}
}
static void startSound16(double[] hertz) {
if (hertz.length == 0) {return;}
try {
SourceDataLine sdl = AudioSystem.getSourceDataLine(af16);
sdl.open();
sdl.start();
int i = 0;
//iterate as long as the sound must play
do {
//create a new buffer
double[] buf = new double[128]; //arbitrary number
final int startI = i;
//iterate through each of the tones
for (int k = 0; k < hertz.length; k++) {
i = startI;
//iterate through each of the samples for this buffer
for (int j = 0; j < buf.length; j++) {
double x = (double)i/bufferLength*hertz[k]*2*Math.PI;
double wave1 = Math.sin(x);
//decrease volume with increasing pitch
// double volume = Math.min(Math.max(300 - hertz[k], 50d), 126d);
double volume = 16384;
buf[j] += wave1*volume;
i++;
if (i == 9999999) { //prevent i from getting too big
i = 0;
}
}
}
final byte[] finalBuffer = new byte[buf.length * 2];
//copy the double buffer into byte buffer
for (int j = 0; j < buf.length; j++) {
//divide by hertz.length to prevent simultaneous sounds
// from becoming too loud
int a = (int) (buf[j] / hertz.length);
finalBuffer[j * 2] = (byte) a;
finalBuffer[(j * 2) + 1] = (byte) (a >>> 8);
}
//play the sound
sdl.write(finalBuffer, 0, finalBuffer.length);
} while (go);
sdl.flush();
sdl.stop();
synchronized (SoundTest.class) {
SoundTest.class.notifyAll();
}
} catch (LineUnavailableException e) {
e.printStackTrace();
}
}
static void playTone(final double hz, final boolean fewBits) {
go = true;
new Thread() {
@Override
public void run() {
if (fewBits) {
startSound8(new double[] {hz});
} else {
startSound16(new double[] {hz});
}
}
}.start();
try {
Thread.sleep(5000);
} catch (InterruptedException x) {
x.printStackTrace();
} finally {
go = false;
synchronized (SoundTest.class) {
try {
SoundTest.class.wait();
} catch (InterruptedException x) {
x.printStackTrace();
}
}
}
}
public static void main(String[] args) {
playTone(220, true);
playTone(220, false);
}
}
我讨论了用于打包 16 位字节数组的位操作的概念 here 并且有示例代码。
还值得一提的是,如果专业应用程序出于某种原因需要使用 8 位,它可能会在量化之前添加 dither,这听起来比纯量化误差要好。 (对于 16 位也是如此,但是 16 位的量化误差是听不见的,除非它被累积。)
我正在尝试实时生成一组同步音调。但是程序产生的所有声音都是 "fuzzy",或者有 "static",甚至在背景中听起来像 "squealing"。这在低音调的声音中尤其明显。 这是代码:
static final long bufferLength = 44100;
static final AudioFormat af = new AudioFormat(bufferLength, 8, 1, true, false);
static boolean go = true; //to be changed somewhere else
static void startSound(double[] hertz) {
if (hertz.length == 0) {return;}
try {
SourceDataLine sdl = AudioSystem.getSourceDataLine(af);
sdl.open();
sdl.start();
int i = 0;
//iterate as long as the sound must play
do {
//create a new buffer
double[] buf = new double[128]; //arbitrary number
final int startI = i;
//iterate through each of the tones
for (int k = 0; k < hertz.length; k++) {
i = startI;
//iterate through each of the samples for this buffer
for (int j = 0; j < buf.length; j++) {
double x = (double)i/bufferLength*hertz[k]*2*Math.PI;
double wave1 = Math.sin(x);
//decrease volume with increasing pitch
double volume = Math.min(Math.max(300 - hertz[k], 50d), 126d);
buf[j] += wave1*volume;
i++;
if (i == 9999999) { //prevent i from getting too big
i = 0;
}
}
}
final byte[] finalBuffer = new byte[buf.length];
//copy the double buffer into byte buffer
for (int j = 0; j < buf.length; j++) {
//divide by hertz.length to prevent simultaneous sounds
// from becoming too loud
finalBuffer[j] = (byte)(buf[j]/hertz.length);
}
//play the sound
sdl.write(finalBuffer, 0, finalBuffer.length);
} while (go);
sdl.flush();
sdl.stop();
} catch (LineUnavailableException e) {
e.printStackTrace();
}
}
//play some deep example tones
startSound(new double[]{65.4064, 58.2705, 48.9995});
我试过录制从这个程序输出的声音,波形确实有点参差不齐。但是当我直接从程序中打印出生成的波浪时,它们看起来非常平滑。我发出的声音似乎与扬声器发出的声音不匹配。谁能发现我做错了什么?
根据我的评论,我认为您听到 quantization error 是因为 8 位音频,您应该切换到 16 位。量化误差有时被称为噪声,但它是一种平方谐波失真,是您听到的细微泛音的来源。
8 位有时对于语音之类的东西是可以接受的,因为它听起来更像噪音。纯音失真更明显。
我把你的代码变成了一个粗略的 MCVE 来证明差异。
class SoundTest {
static final int bufferLength = 44100;
static final AudioFormat af8 = new AudioFormat(bufferLength, 8, 1, true, false);
static final AudioFormat af16 = new AudioFormat(bufferLength, 16, 1, true, false);
static volatile boolean go = true; //to be changed somewhere else
static void startSound8(double[] hertz) {
if (hertz.length == 0) {return;}
try {
SourceDataLine sdl = AudioSystem.getSourceDataLine(af8);
sdl.open();
sdl.start();
int i = 0;
//iterate as long as the sound must play
do {
//create a new buffer
double[] buf = new double[128]; //arbitrary number
final int startI = i;
//iterate through each of the tones
for (int k = 0; k < hertz.length; k++) {
i = startI;
//iterate through each of the samples for this buffer
for (int j = 0; j < buf.length; j++) {
double x = (double)i/bufferLength*hertz[k]*2*Math.PI;
double wave1 = Math.sin(x);
//decrease volume with increasing pitch
// double volume = Math.min(Math.max(300 - hertz[k], 50d), 126d);
double volume = 64;
buf[j] += wave1*volume;
i++;
if (i == 9999999) { //prevent i from getting too big
i = 0;
}
}
}
final byte[] finalBuffer = new byte[buf.length];
//copy the double buffer into byte buffer
for (int j = 0; j < buf.length; j++) {
//divide by hertz.length to prevent simultaneous sounds
// from becoming too loud
finalBuffer[j] = (byte)(buf[j]/hertz.length);
}
//play the sound
sdl.write(finalBuffer, 0, finalBuffer.length);
} while (go);
sdl.flush();
sdl.stop();
synchronized (SoundTest.class) {
SoundTest.class.notifyAll();
}
} catch (LineUnavailableException e) {
e.printStackTrace();
}
}
static void startSound16(double[] hertz) {
if (hertz.length == 0) {return;}
try {
SourceDataLine sdl = AudioSystem.getSourceDataLine(af16);
sdl.open();
sdl.start();
int i = 0;
//iterate as long as the sound must play
do {
//create a new buffer
double[] buf = new double[128]; //arbitrary number
final int startI = i;
//iterate through each of the tones
for (int k = 0; k < hertz.length; k++) {
i = startI;
//iterate through each of the samples for this buffer
for (int j = 0; j < buf.length; j++) {
double x = (double)i/bufferLength*hertz[k]*2*Math.PI;
double wave1 = Math.sin(x);
//decrease volume with increasing pitch
// double volume = Math.min(Math.max(300 - hertz[k], 50d), 126d);
double volume = 16384;
buf[j] += wave1*volume;
i++;
if (i == 9999999) { //prevent i from getting too big
i = 0;
}
}
}
final byte[] finalBuffer = new byte[buf.length * 2];
//copy the double buffer into byte buffer
for (int j = 0; j < buf.length; j++) {
//divide by hertz.length to prevent simultaneous sounds
// from becoming too loud
int a = (int) (buf[j] / hertz.length);
finalBuffer[j * 2] = (byte) a;
finalBuffer[(j * 2) + 1] = (byte) (a >>> 8);
}
//play the sound
sdl.write(finalBuffer, 0, finalBuffer.length);
} while (go);
sdl.flush();
sdl.stop();
synchronized (SoundTest.class) {
SoundTest.class.notifyAll();
}
} catch (LineUnavailableException e) {
e.printStackTrace();
}
}
static void playTone(final double hz, final boolean fewBits) {
go = true;
new Thread() {
@Override
public void run() {
if (fewBits) {
startSound8(new double[] {hz});
} else {
startSound16(new double[] {hz});
}
}
}.start();
try {
Thread.sleep(5000);
} catch (InterruptedException x) {
x.printStackTrace();
} finally {
go = false;
synchronized (SoundTest.class) {
try {
SoundTest.class.wait();
} catch (InterruptedException x) {
x.printStackTrace();
}
}
}
}
public static void main(String[] args) {
playTone(220, true);
playTone(220, false);
}
}
我讨论了用于打包 16 位字节数组的位操作的概念 here 并且有示例代码。
还值得一提的是,如果专业应用程序出于某种原因需要使用 8 位,它可能会在量化之前添加 dither,这听起来比纯量化误差要好。 (对于 16 位也是如此,但是 16 位的量化误差是听不见的,除非它被累积。)