MARL-based policy optimization of wind farm control task

This repo is the code implementation of the paper titled "Reinforcement Learning-based Control of Wind Farm Composed of Hydrostatic Wind Turbines". In this paper, a novel MARL method name Multi-Agent Policy Optimization (MAPO) is proposed to control the hydrostatic wind turbines (HWT) in a wind farm to maximize its power generation. The structure of this repo is:

Dependency

Python3 (include numpy, tensorflow1.0 etc.)
FASTFarm (include OpenFAST)

Simulator

FASTFarm: simulte the dynamics of a wind farm

However, FASTFarm uses the gearbox-based wind turbines and we replace it with the hydrostatic wind turbines. Please see the file ./Gearbox2hydrostatic_transmission.md for detailed modifications.

There are two wind farm cases used in the simulation:

Three HWT in a wind farm (./fast-farm/Three_Turbines)
Six HWT in a wind farm (./fast-farm/Six_Turbines)

The simulation can be swiched by changing Line 90-103 in ./train.py

Data process

./WriteData.py file can write the weight of the policy network to the input file of OpenFAST.

./ReadData.py file can transform the output of OpenFAST into the samples used for training the MARL algorithm.

Multi-agent policy optimization

mapo.py file contains the proposed MARL algorithm named MAPO.

train.py file contains the method to train MAPO.

The hyper-parameters of MAPO

Name	Value	Name	Value
Learning rate	1e-4	Clip range $\epsilon$	0.2
Discounter coefficient	0.99	$\lambda$ return	0.95
Activation function	tanh	Layer units	$[64, 64]$
Episodes	200	Batch size	1024

The method to run the code:

mkdir policy learning_curves
python3 train.py --save-dir "./policy/" --plots-dir "./learning_curves/"

Please use the following command to see other input parameters of the train.py file.

python3 train.py --help

Plot

plot.py shows the method how to plot figures in the paper according to the outputs of train.py file. figures directory includes the figures in the paper

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MARL-based policy optimization of wind farm control task

Dependency

Simulator

Data process

Multi-agent policy optimization

The hyper-parameters of MAPO

The method to run the code:

Plot

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
__pycache__		__pycache__
fast-farm		fast-farm
figures		figures
FAST.Farm		FAST.Farm
Gearbox2hydrostatic_transmission.md		Gearbox2hydrostatic_transmission.md
LICENSE		LICENSE
README.md		README.md
ReadData.py		ReadData.py
WriteData.py		WriteData.py
distributions.py		distributions.py
mapo.py		mapo.py
plot.py		plot.py
tf_util.py		tf_util.py
train.py		train.py

Folders and files

Latest commit

History

Repository files navigation

MARL-based policy optimization of wind farm control task

Dependency

Simulator

Data process

Multi-agent policy optimization

The hyper-parameters of MAPO

The method to run the code:

Plot

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages