格发软件

首页

产品

解决方案

服务支持

关于

软件库

在线咨询

申请试用

155-2731-8020

产品

实现专业软件许可精细化管理

高效利用许可资源、回收闲置许可

合理管控调配许可资源

终端软件管理

终端软件管理和合规性管理共同保障终端安全

多角度管控软件使用权限，保证软件安全性

实用、强大的资产台账管理工具

行业分类

半导体行业

服务支持

关于

产品

解决方案

服务支持

关于

产品

实现专业软件许可精细化管理

高效利用许可资源、回收闲置许可

合理管控调配许可资源

终端软件管理

终端软件管理和合规性管理共同保障终端安全

多角度管控软件使用权限，保证软件安全性

实用、强大的资产台账管理工具

解决方案

半导体行业

服务支持

关于

当前位置：服务支持 > 软件文章 > 分布式TensorFlow测试代码示例

分布式TensorFlow测试代码示例

阅读数 290

点赞 70

copyright

article_banner

数据集：minist （我走的是本地读取）

数据集链接：https://pan.baidu.com/s/1o2faz60YLaba3q7hn_JWqg 提取码：yv3y

代码和数据集放在一个文件下

分布式tensorflow测试代码_tensorflow

目的：测试服务器是否安装成功cuda和cudnn

环境:ubuntu16.04，python3.6,tensorflow-gpu1.10,cuda9.0,cudnn7.4

import mathimport tensorflow as tffrom tensorflow.examples.tutorials
.mnist import input_dataimport osimport timeflags = tf.app.flagsflags
.DEFINE_string("data_dir", r"./mnist", "the directory of mnist_data")flags
.DEFINE_integer("train_step",1000, "the step of train")flags.DEFINE_integer("batch_size", 
128, "the number of batch")flags.DEFINE_integer("image_size", 28, "the size of image")flags
.DEFINE_integer("hid_num", 100, "the size of hid layer")flags.DEFINE_float("learning_rate", 0.01, 
"the learning rate")# flags.DEFINE_string("checkpoint_dir",r"./temp/checkpoint",
"the directory of checkpoint")# flags.DEFINE_string("log_dir",r"./temp/log",
"the directory of log")flags.DEFINE_string("summary_dir", r"./temp/summary", 
"the directory of summary")flags.DEFINE_integer("task_index", 0, "the index of task")flags
.DEFINE_string("job_name", "ps", "ps or worker")flags.DEFINE_string("ps_host","localhost:22333", 
"the ip and port in ps host")flags.DEFINE_string("worker_host", "localhost:21333", 
"the ip and port in worker host")flags.DEFINE_string("cuda", "", 
"specify gpu")FLAGS = flags.FLAGSif FLAGS.cuda:os.environ["CUDA_VISIBLE_DEVICES"] = FLAGS
.cudamnist = input_data.read_data_sets(FLAGS.data_dir, one_hot=True)def main(_):
#train_step_list=[50]ps_spc = FLAGS.ps_host.split(",")worker_spc = FLAGS.worker_host.split(",")
cluster = tf.train.ClusterSpec({"ps": ps_spc, "worker": worker_spc})server = tf
.train.Server(cluster, job_name=FLAGS.job_name, task_index=FLAGS.task_index)if FLAGS
.job_name == "ps":server.join()is_chief = (FLAGS.task_index == 0)with tf
.device(tf.train.replica_device_setter(cluster=cluster)):start = time.time()global_step = tf
.Variable(0, name="global_step", trainable=False)hid_w = tf
.Variable(tf.truncated_normal(shape=[FLAGS.image_size * FLAGS.image_size, FLAGS
.hid_num],stddev=1.0 / FLAGS.image_size), name="hid_w")hid_b = tf
.Variable(tf.zeros(shape=[FLAGS.hid_num]), name="hid_b")sm_w = tf
.Variable(tf.truncated_normal(shape=[FLAGS.hid_num, 10], stddev=1.0 / math
.sqrt(FLAGS.hid_num)),name="sm_w")sm_b = tf.Variable(tf.zeros(shape=[10]), name="sm_b")x = tf
.placeholder(tf.float32, [None, FLAGS.image_size * FLAGS.image_size])y_ = tf
.placeholder(tf.float32, [None, 10])hid_lay = tf.nn.xw_plus_b(x, hid_w, hid_b)hid_act = tf.nn
.relu(hid_lay)y = tf.nn.softmax(tf.nn.xw_plus_b(hid_act, sm_w, sm_b))cross_entropy = -tf
.reduce_mean(y_ * tf.log(tf.clip_by_value(y, 1e-4, 1.0)))train_op = tf.train
.GradientDescentOptimizer(FLAGS.learning_rate).minimize(cross_entropy,global_step=global_step)
#last_step=500hooks = [tf.train.StopAtStepHook(last_step=FLAGS.train_step)]#             
tf.train.CheckpointSaverHook(checkpoint_dir=FLAGS.checkpoint_dir,#                                          
save_steps=1000)]# gpu_options = tf.GPUOptions(per_process_gpu_memory_fraction=0.7)
# sess_config = tf.ConfigProto(gpu_options=gpu_options, log_device_placement=False, 
allow_soft_placement=True)# sess_config.gpu_options.allow_growth = Truesess_config = tf
.ConfigProto(log_device_placement=False)with tf.train.MonitoredTrainingSession(master=server
.target,is_chief=is_chief,#                                           
checkpoint_dir=FLAGS.checkpoint_dir,hooks=hooks,config=sess_config)as mon_sess:step = 0while 
True:step += 1batch_x, batch_y = mnist.train.next_batch(FLAGS.batch_size)
train_feed = {x: batch_x, y_: batch_y}_, loss_v, g_step = mon_sess
.run([train_op, cross_entropy, global_step], feed_dict=train_feed)print("step: %d, cross_entropy: 
%f, global_step:%d" % (step, loss_v, g_step))if mon_sess.should_stop():end = time
.time()#print("step_size=", last_step)print("time costing:", end - start)breakif __name__ == "__main__":
tf.app.run()1.2.3.4.5.6.7.8.9.10.11.12.13.14.15.16.17.18.19.20.21.22.23.24.25.26.27.28.29.30.31.32.33.
34.35.36.37.38.39.40.41.42.43.44.45.46.47.48.49.50.51.52.53.54.55.56.57.58.59.60.61.62.63.64.65.66.67.
68.69.70.71.72.73.74.75.76.77.78.79.80.81.82.83.84.85.86.87.88.

代码是一个ps，一个worker。19行和20行都走的是本地ip，如有需要多机分布式，自行修改。

如果运行提示grpc错误，杀死python的进程

运行代码：

python mnist_monite.py --job_name=ps --task_index=0 --cuda=-11.

再开一个页面，输入：

python mnist_monite.py --job_name=worker --task_index=0 --cuda=01.

然后下图是PS的运行截图：

分布式tensorflow测试代码_ubuntu_02

然后是worker的截图：

分布式tensorflow测试代码_数据集_03

ok了

免责声明：本文系网络转载或改编，未找到原创作者，版权归原作者所有。如涉及版权，请联系删

返回上级列表

，获取更多内容

TensorFlow在数字识别统计中的应用

TensorFlow新项目：日漫风格生成

相关文章

TensorFlow训练BP神经网络：GPU加速技巧

TensorFlow基础入门：常量、变量及基本运算详解

TensorFlow入门教程：矩阵基础

PyTorch与TensorFlow对比：哪个更适合你？

Java调用tensorflow2

TensorFlow在深度学习中的应用

TensorFlow代码架构深度解析

TensorFlow新手教程：模型调试指南

TensorFlow学习笔记001：安装与基础操作

在ml.net中集成TensorFlow

Android平台上TensorFlow集成与行为检查实践

TensorFlow框架支持的机器学习算法探讨

TensorFlow架构全面解析与介绍

TensorFlow Lite 架构图 tensorflow架构及原理

Java 使用tensorflow 模型训练

技术文档

格发许可分析软件管理系统宣传

软件实现正版化-格发最专业的解决方案

企业软件资产和License管理遇到的问题和解决办法

UG许可资源优化解决方案-许可不够用，解决UG盗版，UG许可监控，UG律师函

公司使用盗版SolidWorks被发函，solidworks盗版检测，solidworks 被软件公司查到用盗版，SolidWork价格减少

Teamcenter无法创建多余账号怎么办？

如何解决许可不足问题以提升许可利用率

CATIA的license资源管理-gofar许可优化效果

企业如何进行合规性管理

收到西门子发来的UG告知函怎么办？Solidworks盗版被查如何防范？厂商是怎么样查到公司在用盗版，有什么方法可以核实真假？……

热门文章

许可证短缺，我们每天都在“重复劳动”

数据泄露溯源技术：从软件行为到责任人定位

软件许可证集中采购谈判技巧与成本节省案例

新员工入职软件合规教育：必学内容与考核标准

采购总监的生死局：为什么有人能穿越周期，有人被市场绞杀？

亚克力的多种正确用法，让你的设计“很贵”很有范儿！

gotoDetail

武汉格发信息技术有限公司

湖北省武汉市经开区科技园西路6号103孵化器

电话：155-2731-8020 座机：027-59821821

电子邮件：tanzw@gofarlic.com

友情链接

格发

发现

终端软件管理

方案

半导体行业

服务

关于

© gofarlic.com 武汉格发信息技术有限公司 - 鄂ICP备18026411号-1 - 鄂公网安备42011302000881号

隐私声明 | 使用条款 | 网站地图

联系我们

武汉格发信息技术有限公司

湖北省武汉市经开区科技园西路6号103孵化器

电话：155-2731-8020 座机：027-59821821

邮件：tanzw@gofarlic.com

发现

终端软件管理

方案

半导体行业

服务

关于

隐私声明 | 使用条款

Copyright © 2023 Gofarsoft Co.,Ltd. 保留所有权利

鲁ICP备14018425号-1 鄂公网安备42011302000881号

遇到许可问题？该如何解决！？

评估许可证实际采购量？

不清楚软件许可证使用数据？

收到软件厂商律师函!?

想要少购买点许可证，节省费用？

收到软件厂商侵权通告!?

有正版license，但许可证不够用，需要新购？

联系方式

155-2731-8020

预留信息，一起解决您的问题

* 姓名：

* 手机：

* 公司名称：

姓名不为空

手机不正确

公司不为空