教小白精通编程

面向小白的编程教学博客

Resources for policy gradient

layout: post title: Resources for policy gradient subtitle: Resources for policy gradient date: 2020-01-29 13:51:00 author: “xuepro” header-img: “img/home_bg.jpg” ...

A PyTorch implementation of deep Q-learning Network (DQN) for Atari games

A PyTorch implementation of deep Q-learning Network (DQN) for Atari games

Deep Q-learning Network (DQN) can be used to train an agent to play Atari games: We often use continuous frames to represent an state of the enviroment. DQN use replay mempry to store ecxperienc...

install swig and box2d-py for windows

install swig and box2d-py for windows

I encountered an error when I want to use the lunar_lander environment in gym. The error is : error: “module ‘gym.envs.box2d’ has no attribute ‘LunarLander’” I searched on internet and found the ...

强化学习资源

Resources for reinforcement learning

Awesome Reinforcement Learning https://github.com/ZhiqiangHo/awesome-reinforcement-learning courses: http://rail.eecs.berkeley.edu/deeprlcourse/ A brief introduction to reinforcement learning ...

比较Pytorch和我的卷积实现

Comparison of my convolution implementation to pytorch

the Test code for pytorch conv2d # pytorch import torch import torch.nn as nn x = torch.tensor(range(50),dtype=torch.float) x = torch.reshape(x, (1,2,5,5)) #N,C,H,W x.requires_grad = True print(x.s...

如何用google colab运行你的深度学习代码

how to use google colab for your deep learning

google Colaboratory(简称 google colab)是谷歌提供的免费的云端的Jupyter notebook环境,google colab已经自带了各种常用的python程序包包括著名的深度学习库tensorflow和pytorch,特别是google colab提供了免费的GPU,使得不需要做任何安装和配置就能在浏览器编写和执行python代码(包括机器学习、深度学习代...

test the deep learning library. in my book

test the deep learning library. in my book

test dense layer import numpy as np import matplotlib.pyplot as plt %matplotlib inline from NeuralNetwork import * import util np.random.seed(1) pts = 100 k = 2 b =1 X = np.random.randn(pts,1)*...

The implementation of LSTM from scratch(LSTM的实现代码)

The implementation of LSTM from scratch(LSTM的实现代码)

The Equations for LSTM I deleted the code which will be put here after the publishing of my book on deep learning

人脸相关论文

Face related papers

SC-FEGAN : Face Editing Generative Adversarial Network with User’s Sketch and Color code Generating custom photo-realistic faces using AI code deepfake code Generate human faces with neural n...

1小时学会Python-深度学习原理与实现的1.4节

Python tutorial:Learning Python in an hour

1小时学会Python ( Learning Python in an hour ) Python是一个动态类型的高级语言,所谓“动态类型”是指Python能自动从变量值推断其类型。可以用Python内置函数type()查询一个值的类型。如 type(2) int type(3.14) float Python的print()函数可以输出一系列用逗号’,’隔开的对象,如: p...