SQL 计算属性总和的平均值

SQL Average of Sum of Calculated Attribute

我正在尝试取一个类别的平均值,其中行按子类别分组并计算总和。 Parent Table 的主键是 Child Table 的分组属性。 Parent Table 的分组属性既不是主键也不是 Child Table.

简单表示:

select Category, avg(CalculatedSum)
from ParentTable pt
inner join (
    select Subcategory, sum(Quantity * Price) as 'CalculatedSum'
    from ChildTable
    group by Subcategory
    ) ct
on pt.ID = ct.Subcategory
group by Category

实际SQL如下:

select c.CU_AGE_RANGE, count(*) as '# of Customers', avg(SumSales) as 'Avg of SumSales', max([Max of SumSales]) as 'Max of SumSales', min([Min of SumSales]) as 'Min of SumSales'
from Customers c
inner join (
    select CUSTOMER_ID, sum(QTY_SOLD * SALES) as SumSales, max(QTY_SOLD*SALES) as 'Max of SumSales', min(QTY_SOLD*SALES) as 'Min of SumSales'
    from Sales
    where (SALES > 0) and (QTY_SOLD > 0) and (COST > 0)
    Group by CUSTOMER_ID
    ) s
on c.CUSTOMER_ID = s.CUSTOMER_ID
group by c.CU_AGE_RANGE

我曾尝试将 group by 子句更改为类别 (CU_AGE_RANGE) 和子类别 (CUSTOMER_ID) 的各种顺序,但总是出现相同的错误。

错误是 table 总是显示 SUMS 的 SUM(我相信)。我假设这是错误,因为 Child Table 中的典型平均值是 250 到 1000,而 Avg(Sum()) 返回的值大致是每个类别的行数乘以预期的 Sum() .

由于信誉度低,我不能post一张照片,所以请看下面的逗号分隔结果Table:

CU_AGE_RANGE,#_of_Customers,Avg_of_SumSales,Max_of_SumSales,Min_of_SumSales
NULL,125,4261665.306,433460737.7,0.0017
20-29     ,1192,1154040.907,1374037708,0.00025
30-39     ,1902,25429.52329,29426212.64,0.00015
40-49     ,2118,2418.829874,2066725,0.0001
50-59     ,2204,114625.4111,248240261.3,0.00015
60+       ,2135,160156.4341,334617675,0.0005
patrickbig,1,65.5737,12,0.06
Under 19  ,484,1431.262112,92160,0.0001

我想弄清楚为什么 AVG(SUM()) 返回的似乎是 SUM(SUM())。我目前的预感是,由于 SUM() 是一个计算条目,因此计算值将根据 Parent Table 中的分组重新计算。所以这将是:

期望:

x * y for each row in Child Table
sum(x*y) for each Subcategory
Avg(sum(x/y)) for each Category of Subcategory

QTY_SOLD * SALE for each row in Sales
sum(QTY_SOLD*SALE) for each CUSTOMER_ID
avg(sum(QTY_SOLD*SALE) for each CU_AGE_RANGE group of CUSTOMER_IDs

实际:

x * y for each row in Child Table                  
sum(x * y) for each Subcategory
avg(sum(x * y) for each Category

avg(sum(QTY_SOLD*SALE) for each CU_AGE_RANGE

等于:

sum(QTY_SOLD*SALE) for each CU_AGE_RANGE

如何从当前(类别总和)到所需(按子类别总和的类别平均)?

您的客户数量有误。您计算的是销售额,而不是客户数量。更改为 count( DISTINCT c.CUSTOMER_ID ) 应该可以解决问题。

select c.CU_AGE_RANGE, count( DISTINCT c.CUSTOMER_ID ) as '# of Customers', avg(SumSales) as 'Avg of SumSales', max([Max of SumSales]) as 'Max of SumSales', min([Min of SumSales]) as 'Min of SumSales'
from Customers c
inner join (
    select CUSTOMER_ID, sum(QTY_SOLD * SALES) as SumSales, max(QTY_SOLD*SALES) as 'Max of SumSales', min(QTY_SOLD*SALES) as 'Min of SumSales'
    from Sales
    where (SALES > 0) and (QTY_SOLD > 0) and (COST > 0)
    Group by CUSTOMER_ID
    ) s
on c.CUSTOMER_ID = s.CUSTOMER_ID
group by c.CU_AGE_RANGE

先考虑子查询:

select Subcategory, sum(Quantity * Price) as 'CalculatedSum'
from ChildTable
group by Subcategory

结果关系的每条记录都代表 Subcategory 的聚合。现在,avg(CalculatedSum) 应该产生 CalculatedSum 值的平均值。尝试计算 sum(CalculatedSum),看看是否有差异。