H. M. ZHOU ET AL.
618
2.4. Statistical Analysis of Categories of
Indicators
Statistical analysis indicator refers to the statistical anal-
ysis of the previous two categories of indicator s during a
period of time.
1) Total statistics:
The purpose is that to stat the cumulative quantity of
some indicators over a period of time. Such as: failure
time (seconds) of frequency, the unqualified cumulative
number of days and failure time, the maintenance ticket
number and the operating ticket number performed
monthly, power grid network losses, unplanned generator
outage times, and the frequency of fault trip of power
lines, bus, together (main).
2) Mean / extreme value statistics:
The purpose is that to stat the maxi mum, the minimum
or the average indicators over a period of time.
Such as: the largest number of operating ticket in sin-
gle-day, the 96-points load characteristic curve, together
(main) maximum load change rate, water consumption
rate of direct transfer hydropower plants, thermal power
plant coal consumption rate of direct transfer, high- vol-
tage transmission network net loss, 220 kV and more
rapid protection line fault removal rate, times of power
cuts and the average daily capacity and so on.
3) Percentage statistics:
The purpose is that to stat the ratio between the two
types of indicators of the total in a period of time. Such
as: the correct rate of the moves of security control de-
vice, the tie-line bias and frequency control passing rate,
voltage passing rate, load forecasting accuracy, the pass-
ing rate of the low-frequency load shedding control ca-
pacity, the operation rate of unit PSS, the operation rate
of unit AGC, the qualified rate of the operating vote.
3. Scheduling Operaion Analysis Data
Warehouse Platform
3.1. Scheduling Integrated Data Warehouse
Platform
All the available data relative to the supplier’s bidding
can be divided into three classes:
Data Hub is an onlin e system that is designed for Stats
analysis and decision support applications. It could meet
the decision support and online analytical applications
require all. This data is called data warehouse platform.
The establishment of dispatching and comprehensive
analysis platform for data mining is responsible for col-
lecting all kinds of the required scheduling run data indi-
cators in regular time.
1) The basic information of network parameters, pow-
er plants and units.
2) Power generation, by the power, the load plan and
actual data.
3) Power line trend data, the node voltage data.
4) Local power plants planned and actual data.
5) AGC unit and various indicators of data assessment.
6) Market transactions and load forecast data.
7) Reporting of transactions, transaction data.
8) All kinds of i nf ormation in regional p ower market .
9) Regional power grid parameters and inter-provin-
cial trade data.
Data warehouse is a new application of a database
technology, and so far, the data warehouse is a relational
database management system to manage the data.
4. Olap Multidimensional Data Analysis and
Reporting System
Data warehouse contains a lot of data extracted from a
number of databases, but these data could play its proper
value only by using the right tools and being used effec-
tively.
The methods of data mining for the data in the data
warehouse to be analyzed include online analytical proc-
essing (OLAP), association rule mining, decision tree
analysis, cluster analysis and other. Shanghai power sys-
tem mainly uses online analytical processing (OLAP)
technology.
4.1. Dispatching Multi-dimensional Analysis and
Processing
Multi-dimensional data ex traction and OLAP data analy-
sis could be done by the use of all kinds of data in the
dispatching data warehouse. As shown in Figur e 1:
1) Time dimension: Data classification according to
year, quart e r, month , week, day, ho ur and minute.
2) Period dimension: Data classification according to
the three period-peak, trough , and waist load.
3) Regional dimension: Data classification according
to different regions.
4) Plant dimensions: Data classification according to
power plant, the type (bid/peaking/FM/self) and Power
Generatio n Gr oup.
5) Unit dimensions: Data classification depending on
the capacity of the unit.
6) Line dimensions: data analysis depending on the
line, the line type.
7) Substation dimensions: Data classification accord-
ing to the type of substation and sub station.
8) Weather dimensions: Data classification according
to the temperature, humidity, the sunny weather or the
cloudy weather and other standard.
9) Custom dimension: We could create new data clas-
sification flexibly according to the needs of scheduling
Analysis, such as the level of the plant load factor, power
frequency, peak and valley levels and any other levels of
parameters.
Copyright © 2013 SciRes. EPE