WindTurbineClassification
Specification of 'normal' wind turbine operating behaviour for rapid anomaly detection.
https://github.com/nmstreethran/WindTurbineClassification
Category: Renewable Energy
Sub Category: Wind Energy
Keywords
energy jupyter-notebook python renewable-energy wind-turbines
Last synced: about 21 hours ago
JSON representation
Repository metadata
My master's dissertation on wind turbine fault prediction using machine learning
- Host: GitHub
- URL: https://github.com/nmstreethran/WindTurbineClassification
- Owner: nmstreethran
- License: mit
- Created: 2018-07-29T23:36:18.000Z (over 6 years ago)
- Default Branch: current
- Last Pushed: 2024-03-17T15:04:11.000Z (about 1 year ago)
- Last Synced: 2025-04-25T12:45:36.557Z (2 days ago)
- Topics: energy, jupyter-notebook, python, renewable-energy, wind-turbines
- Language: Python
- Homepage:
- Size: 54.6 MB
- Stars: 56
- Watchers: 3
- Forks: 11
- Open Issues: 0
- Releases: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE.md
README.md
WindTurbineClassification
Specification of 'normal' wind turbine operating behaviour for rapid anomaly detection: through the use of machine learning algorithms
by Nithiya Streethran ([email protected])
This work is derived from my dissertation for the degree of Master of Science (MSc) in Renewable Energy Engineering at Heriot-Watt University, which was completed during a technical placement at Natural Power between May and August 2017.
Unfortunately, the datasets are proprietary industry data and I do not own the rights to distribute them to the public. Please do not contact me to request the datasets.
Abstract
Maximising the economic effectiveness of a wind farm is essential in making wind a more economic source of energy. This effectiveness can be increased through the reduction of operation and maintenance costs, which can be achieved through continuously monitoring the condition of wind turbines. An alternative to expensive condition monitoring systems, which can be uneconomical especially for older wind turbines, is to implement classification algorithms on supervisory control and data acquisition (SCADA) signals, which are collected in most wind turbines. Several publications were reviewed, which were all found to use separate algorithms to predict specific faults in advance. In reality, wind turbines tend to have multiple faults which may happen simultaneously and have correlations with one another. This project focusses on developing a methodology to predict multiple wind turbine faults in advance simultaneously by implementing classification algorithms on SCADA signals for a wind farm with 25 turbines rated at 2,500 kW, spanning a period of 30 months. The data, which included measurements of wind speed, active power and pitch angle, was labelled using corresponding downtime data to detect normal behaviour, faults and varying timescales before a fault occurs. Three different classification algorithms, namely decision trees, random forests and k nearest neighbours were tested using imbalanced and balanced training data, initially to optimise a number of hyperparameters. The random forest classifier produced the best results. Upon conducting a more detailed analysis on the performance of specific faults, it was found that the classifier was unable to detect the varying timescales before a fault with accuracy comparable to that of normal or faulty behaviour. This could have been due to the SCADA data, which are used as features, being unsuitable for detecting the faults, and there is potential to improve this by balancing only these classes.
Keywords: wind turbine, classification algorithm, SCADA, fault detection, condition monitoring
Scripts
Source | Output | |
---|---|---|
Process SCADA and downtime data | ||
Downtime categories | ||
Merge SCADA and downtime data | ||
Power curves for all turbines |
License
Unless otherwise stated:
- Code and scripts are licensed under the MIT License.
- Content, images, and documentation are licensed under a Creative Commons Attribution 4.0 International (CC-BY-4.0) License.
Project badges are generated using Shields.io and Simple Icons.
Owner metadata
- Name: Nithiya Streethran
- Login: nmstreethran
- Email:
- Kind: user
- Description:
- Website: https://nithiya.gitlab.io
- Location: Ireland
- Twitter:
- Company:
- Icon url: https://avatars.githubusercontent.com/u/31223306?u=4e54b12a41e730418028bcc10db0f652eaa8d58e&v=4
- Repositories: 14
- Last ynced at: 2025-04-19T04:22:01.054Z
- Profile URL: https://github.com/nmstreethran
GitHub Events
Total
- Watch event: 4
Last Year
- Watch event: 4
Committers metadata
Last synced: 6 days ago
Total Commits: 96
Total Committers: 1
Avg Commits per committer: 96.0
Development Distribution Score (DDS): 0.0
Commits in past year: 1
Committers in past year: 1
Avg Commits per committer in past year: 1.0
Development Distribution Score (DDS) in past year: 0.0
Name | Commits | |
---|---|---|
nmstreethran | n****n@g****m | 96 |
Committer domains:
Issue and Pull Request metadata
Last synced: 2 days ago
Total issues: 0
Total pull requests: 1
Average time to close issues: N/A
Average time to close pull requests: 10 minutes
Total issue authors: 0
Total pull request authors: 1
Average comments per issue: 0
Average comments per pull request: 0.0
Merged pull request: 1
Bot issues: 0
Bot pull requests: 0
Past year issues: 0
Past year pull requests: 0
Past year average time to close issues: N/A
Past year average time to close pull requests: N/A
Past year issue authors: 0
Past year pull request authors: 0
Past year average comments per issue: 0
Past year average comments per pull request: 0
Past year merged pull request: 0
Past year bot issues: 0
Past year bot pull requests: 0
Top Issue Authors
Top Pull Request Authors
- nmstreethran (1)
Top Issue Labels
Top Pull Request Labels
Dependencies
- imbalanced-learn *
- jupyterlab *
- matplotlib *
- openpyxl *
- pandas *
- imbalanced-learn
- jupyterlab
- matplotlib
- openpyxl
- pandas
- python 3.*
Score: 4.02535169073515