设为首页 收藏本站
查看: 763|回复: 0

[经验分享] Hadoop源码之JobTracker

[复制链接]

尚未签到

发表于 2016-12-10 06:06:45 | 显示全部楼层 |阅读模式
  转自:http://blog.csdn.net/wwtang9527/article/details/8330472
  JobTracker是Map/Reducer中任务调度的服务器。
  1、有如下线程为其服务:
  1)提供两组RPC服务(InterTrackerProtocol、JobSubmissionProtocol)的1个Listener线程与默认10个Handler线程;
  2)提供任务执行情况查询的一组web服务线程,包括Socker Listener等;
  3)ExpireTrackers:用来停止已经无效的TaskTracker服务;

[java]
view plaincopyprint?






  • synchronized(taskTrackers){

  • synchronized(trackerExpiryQueue){

  • longnow=System.currentTimeMillis();
  • TaskTrackerStatusleastRecent=null;

  • while((trackerExpiryQueue.size()>0)&&
  • ((leastRecent=(TaskTrackerStatus)trackerExpiryQueue.first())!=null)&&
  • (now-leastRecent.getLastSeen()>TASKTRACKER_EXPIRY_INTERVAL)){

  • //Removeprofilefromheadofqueue

  • trackerExpiryQueue.remove(leastRecent);
  • StringtrackerName=leastRecent.getTrackerName();

  • //Figureoutiflast-seentimeshouldbeupdated,oriftrackerisdead

  • TaskTrackerStatusnewProfile=(TaskTrackerStatus)taskTrackers.get(leastRecent.getTrackerName());
  • //ItemsmightleavethetaskTrackersetthroughothermeans;the

  • //statusstoredin'taskTrackers'mightbenull,whichmeansthe

  • //trackerhasalreadybeendestroyed.


  • if(newProfile!=null){

  • if(now-newProfile.getLastSeen()>TASKTRACKER_EXPIRY_INTERVAL){
  • //Removecompletely
  • updateTaskTrackerStatus(trackerName,null);
  • lostTaskTracker(leastRecent.getTrackerName());
  • }else{

  • //Updatetimebyinsertinglatestprofile
  • trackerExpiryQueue.add(newProfile);
  • }
  • }
  • }
  • }
  • }



synchronized (taskTrackers) {
synchronized (trackerExpiryQueue) {
long now = System.currentTimeMillis();
TaskTrackerStatus leastRecent = null;
while ((trackerExpiryQueue.size() > 0) &&
((leastRecent = (TaskTrackerStatus) trackerExpiryQueue.first()) != null) &&
(now - leastRecent.getLastSeen() > TASKTRACKER_EXPIRY_INTERVAL)) {
// Remove profile from head of queue
trackerExpiryQueue.remove(leastRecent);
String trackerName = leastRecent.getTrackerName();
// Figure out if last-seen time should be updated, or if tracker is dead
TaskTrackerStatus newProfile = (TaskTrackerStatus) taskTrackers.get(leastRecent.getTrackerName());
// Items might leave the taskTracker set through other means; the
// status stored in 'taskTrackers' might be null, which means the
// tracker has already been destroyed.
if (newProfile != null) {
if (now - newProfile.getLastSeen() > TASKTRACKER_EXPIRY_INTERVAL) {
// Remove completely
updateTaskTrackerStatus(trackerName, null);
lostTaskTracker(leastRecent.getTrackerName());
} else {
// Update time by inserting latest profile
trackerExpiryQueue.add(newProfile);
}
}
}
}
}
  4)RetireJobs:用来除去已经完成任务的TaskTracker;

[java]
view plaincopyprint?






  • synchronized(jobs){

  • synchronized(jobInitQueue){

  • synchronized(jobsByArrival){

  • for(Iteratorit=jobs.keySet().iterator();it.hasNext();){
  • Stringjobid=(String)it.next();
  • JobInProgressjob=(JobInProgress)jobs.get(jobid);


  • if(job.getStatus().getRunState()!=JobStatus.RUNNING&&
  • job.getStatus().getRunState()!=JobStatus.PREP&&
  • (job.getFinishTime()+RETIRE_JOB_INTERVAL<System.currentTimeMillis())){
  • it.remove();

  • jobInitQueue.remove(job);
  • jobsByArrival.remove(job);
  • }
  • }
  • }
  • }
  • }



synchronized (jobs) {
synchronized (jobInitQueue) {
synchronized (jobsByArrival) {
for (Iterator it = jobs.keySet().iterator(); it.hasNext(); ) {
String jobid = (String) it.next();
JobInProgress job = (JobInProgress) jobs.get(jobid);
if (job.getStatus().getRunState() != JobStatus.RUNNING &&
job.getStatus().getRunState() != JobStatus.PREP &&
(job.getFinishTime() + RETIRE_JOB_INTERVAL < System.currentTimeMillis())) {
it.remove();
jobInitQueue.remove(job);
jobsByArrival.remove(job);
}
}
}
}
}
  

5)JobInitThread:用来对job做一些初始化的工作;

[java]
view plaincopyprint?






  • synchronized(jobInitQueue){

  • if(jobInitQueue.size()>0){
  • job=(JobInProgress)jobInitQueue.elementAt(0);
  • jobInitQueue.remove(job);
  • }else{

  • try{
  • jobInitQueue.wait(JOBINIT_SLEEP_INTERVAL);
  • }catch(InterruptedExceptioniex){
  • }
  • }
  • }

  • try{

  • if(job!=null){
  • job.initTasks();
  • }
  • }catch(Exceptione){
  • LOG.log(Level.WARNING,"jobinitfailed",e);
  • job.kill();
  • }



synchronized (jobInitQueue) {
if (jobInitQueue.size() > 0) {
job = (JobInProgress) jobInitQueue.elementAt(0);
jobInitQueue.remove(job);
} else {
try {
jobInitQueue.wait(JOBINIT_SLEEP_INTERVAL);
} catch (InterruptedException iex) {
}
}
}
try {
if (job != null) {
job.initTasks();
}
} catch (Exception e) {
LOG.log(Level.WARNING, "job init failed", e);
job.kill();
}
  

2、实现了两组rpc服务(协议),其中InterTrackerProtocol如下:
  1)TaskTracker间隔几秒钟发送的心跳服务;

[java]
view plaincopyprint?






  • intemitHeartbeat(TaskTrackerStatusstatus,booleaninitialContact);



int emitHeartbeat(TaskTrackerStatus status, boolean initialContact);
  2)向JobTracker获取新的任务;

[java]
view plaincopyprint?





  • TaskpollForNewTask(StringtrackerName);



Task pollForNewTask(String trackerName);
  3)询问JobTracker,任务是否可以终结;

[java]
view plaincopyprint?





  • StringpollForTaskWithClosedJob(StringtrackerName);



String pollForTaskWithClosedJob(String trackerName);
  4)Reduce Task询问JobTracker,哪些Map Task已经结束;

[java]
view plaincopyprint?





  • MapOutputLocation[]locateMapOutputs(StringtaskId,String[][]mapTasksNeeded);



  MapOutputLocation[] locateMapOutputs(String taskId, String[][] mapTasksNeeded);

  5)获取文件系统名;

[java]
view plaincopyprint?






  • publicStringgetFilesystemName()throwsIOException;



public String getFilesystemName() throws IOException;
  JobSubmissionProtocol如下:
  1)提交一个待执行的job;

[java]
view plaincopyprint?






  • publicJobStatussubmitJob(StringjobFile)throwsIOException;



public JobStatus submitJob(String jobFile) throws IOException;
  2)杀死一个job;

[java]
view plaincopyprint?






  • publicvoidkillJob(Stringjobid);



public void killJob(String jobid);
  3)获取job的名字、id等信息;

[java]
view plaincopyprint?






  • publicJobProfilegetJobProfile(Stringjobid);



public JobProfile getJobProfile(String jobid);
  4)获取job的状态;

[java]
view plaincopyprint?






  • publicJobStatusgetJobStatus(Stringjobid);



public JobStatus getJobStatus(String jobid);
  5)获取Map任务的报告;

[java]
view plaincopyprint?






  • publicTaskReport[]getMapTaskReports(Stringjobid);



public TaskReport[] getMapTaskReports(String jobid);
  6)获取Reduce任务的报告;

[java]
view plaincopyprint?






  • publicTaskReport[]getReduceTaskReports(Stringjobid);

运维网声明 1、欢迎大家加入本站运维交流群:群②:261659950 群⑤:202807635 群⑦870801961 群⑧679858003
2、本站所有主题由该帖子作者发表,该帖子作者与运维网享有帖子相关版权
3、所有作品的著作权均归原作者享有,请您和我们一样尊重他人的著作权等合法权益。如果您对作品感到满意,请购买正版
4、禁止制作、复制、发布和传播具有反动、淫秽、色情、暴力、凶杀等内容的信息,一经发现立即删除。若您因此触犯法律,一切后果自负,我们对此不承担任何责任
5、所有资源均系网友上传或者通过网络收集,我们仅提供一个展示、介绍、观摩学习的平台,我们不对其内容的准确性、可靠性、正当性、安全性、合法性等负责,亦不承担任何法律责任
6、所有作品仅供您个人学习、研究或欣赏,不得用于商业或者其他用途,否则,一切后果均由您自己承担,我们对此不承担任何法律责任
7、如涉及侵犯版权等问题,请您及时通知我们,我们将立即采取措施予以解决
8、联系人Email:admin@iyunv.com 网址:www.yunweiku.com

所有资源均系网友上传或者通过网络收集,我们仅提供一个展示、介绍、观摩学习的平台,我们不对其承担任何法律责任,如涉及侵犯版权等问题,请您及时通知我们,我们将立即处理,联系人Email:kefu@iyunv.com,QQ:1061981298 本贴地址:https://www.yunweiku.com/thread-311959-1-1.html 上篇帖子: hadoop里面的代码片段 下篇帖子: hadoop源码解析copyFromLocal
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

扫码加入运维网微信交流群X

扫码加入运维网微信交流群

扫描二维码加入运维网微信交流群,最新一手资源尽在官方微信交流群!快快加入我们吧...

扫描微信二维码查看详情

客服E-mail:kefu@iyunv.com 客服QQ:1061981298


QQ群⑦:运维网交流群⑦ QQ群⑧:运维网交流群⑧ k8s群:运维网kubernetes交流群


提醒:禁止发布任何违反国家法律、法规的言论与图片等内容;本站内容均来自个人观点与网络等信息,非本站认同之观点.


本站大部分资源是网友从网上搜集分享而来,其版权均归原作者及其网站所有,我们尊重他人的合法权益,如有内容侵犯您的合法权益,请及时与我们联系进行核实删除!



合作伙伴: 青云cloud

快速回复 返回顶部 返回列表