Catch up on stories from the past week (and beyond) at the Slashdot story archive

 



Forgot your password?
typodupeerror
×
Businesses Programming IT

Exploring the Relationships Between Tech Skills (Visualization) 65

Nerval's Lobster writes: Simon Hughes, Dice's Chief Data Scientist, has put together an experimental visualization that explores how tech skills relate to one another. In the visualization, every circle or node represents a particular skill; colors designate communities that coalesce around skills. Try clicking "Java", for example, and notice how many other skills accompany it (a high-degree node, as graph theory would call it). As a popular skill, it appears to be present in many communities: Big Data, Oracle Database, System Administration, Automation/Testing, and (of course) Web and Software Development. You may or may not agree with some relationships, but keep in mind, it was all generated in an automatic way by computer code, untouched by a human. Building it started with Gephi, an open-source network analysis and visualization software package, by importing a pair-wise comma-separated list of skills and their similarity scores (as Simon describes in his article) and running a number of analyses: Force Atlas layout to draw a force-directed graph, Avg. Path Length to calculate the Betweenness Centrality that determines the size of a node, and finally Modularity to detect communities of skills (again, color-coded in the visualization). The graph was then exported as an XML graph file (GEXF) and converted to JSON format with two sets of elements: Nodes and Links. "We would love to hear your feedback and questions," Simon says.
This discussion has been archived. No new comments can be posted.

Exploring the Relationships Between Tech Skills (Visualization)

Comments Filter:
  • I didn't find PHP anywhere.

    Also, is there a way (or browser) to make the diagram larger?

    • by brausch ( 51013 )

      Found the zoom. Still didn't find PHP. Curious.

    • Re:No PHP? (Score:5, Funny)

      by AmazingRuss ( 555076 ) on Thursday July 02, 2015 @01:26AM (#50030751)

      "I didn't find PHP anywhere."

      If only...

    • by Lando ( 9348 )

      Remember "You may or may not agree with some relationships, but keep in mind, it was all generated in an automatic way by computer code, untouched by a human."

      How amazing computer code that has never been touched by a human. Organic software I guess, no possible way for human bias to creep in.

    • by Anonymous Coward

      I didn't find PHP anywhere.

      Well, the article *did* say it was limited to technical skills...

      Also, is there a way (or browser) to make the diagram larger?

      Case in point.

      • What technical skills? Maybe web 2.0, but definitely not TECH. Technical is the hard engineering, CompSci / Physical / Simulations.
      • by pla ( 258480 )
        Technical skills... You mean like how half of the bubbles say fluffy BS like "leadership", "strategy", "demand generation" and "spanish" (yes, the chart includes "spanish" as a node, over near ITIL)? Yeah, sure, "technical" - If you work in HR, maybe.

        "Applicant must recognize a computer, and not attempt to eat the mouse. 2-4 years experience stuffing envelopes preferred, because we don't understand metered trifolds. Ideal candidate can tell Brioni from Armani by smell alone".
  • by GoodNewsJimDotCom ( 2244874 ) on Thursday July 02, 2015 @01:01AM (#50030681)
    This is a nice visualization. If I was young and aimless, I might see some value in learning new techs I don't actually need right now. Learning new techs could help you land jobs in today's age of clueless HR people judging you by your tech list vs your ability.

    However, I'm getting older and I learn another tech only if I actually need to use it for something I'm working on. So I'll be happily deficient in lots of languages I don't need. I'm certain I could be at least of average skill after about two weeks of most languages as that is my past experience with new languages. But don't tell HR. Today's software engineering world is so averse to training people it rarely considers searching for a veteran software engineer and letting him come up to speed on random techs.

    If I spent a couple weeks on every tech I hear about for the sake of toying with it, I'd never get anything done.
    • Today's software engineering world is so averse to training people it rarely considers searching for a veteran software engineer and letting him come up to speed on random techs.

      Not to put too fine a point on it but that's your own responsibility, not the company you work for.

      If there is an aversion to companies training people. that' offset by the ease of learning any newer (or even older) technology, for free.

      If you wait for the company to help you, you (and your career) will ossify. I have seen the res

      • Re: (Score:2, Informative)

        by Anonymous Coward

        Today's software engineering world is so averse to training people it rarely considers searching for a veteran software engineer and letting him come up to speed on random techs.

        Not to put too fine a point on it but that's your own responsibility, not the company you work for.

        Are you kidding?

        Sure, that applies if you're a short-term contractor. But permanent staff ought to be getting a regular training budget from their companies. If you're not, then you're working for a bad employer.

        If there is an aversion to companies training people. that' offset by the ease of learning any newer (or even older) technology, for free.

        It may be free in the financial sense, but it still has a cost in time. A good employer will give you that time. And pay the costs if there are any. You may need to explicitly ask for it, but the bottom line is that they should be investing in their long-term staff.

        If you wait for the company to help you, you (and your career) will ossify. I have seen the result when I was younger, the result is not good for your freedom to choose favorable working conditions.

        My own experience: In the last yea

      • Today's software engineering world is so averse to training people it rarely considers searching for a veteran software engineer and letting him come up to speed on random techs.

        Not to put too fine a point on it but that's your own responsibility, not the company you work for.

        Only if you are a contractor. Otherwise, the answer is no.

  • by Penguinisto ( 415985 ) on Thursday July 02, 2015 @01:02AM (#50030683) Journal

    Find and click on MongoDB. Then notice that "Database" is this tiny-assed little circle waaaaaaaaaaaaaaaaaaay off to one side.

    Got a pretty good laugh out of it. :)

    • it's just analyzing the appearance of words in listed skills

      actual database pros would not put "database" as an enumerated skill

      maybe the kind of person who lists "windows" "internet explorer" and "microsoft word" as tech skills would, but such people would not show up in the data set analyzed here: resumes from serious professionals working in the tech sector

      so it makes sense "database" would only be a tiny little distant circle

      • by ranton ( 36917 )

        it's just analyzing the appearance of words in listed skills

        so it makes sense "database" would only be a tiny little distant circle

        So what you are saying is their chosen implementation for analyzing relationships between skills is incomplete?

        • are you saying there exists some implementation that analyzes every resume in existence perfectly? it's "incomplete" in the sense that any such effort is incomplete and imperfect by nature of the problem. your criticism is invalid, you don't understand the task if you expect completeness is possible

          • by ranton ( 36917 )

            are you saying there exists some implementation that analyzes every resume in existence perfectly? it's "incomplete" in the sense that any such effort is incomplete and imperfect by nature of the problem. your criticism is invalid, you don't understand the task if you expect completeness is possible

            The task is to determine which professional skills relate to each other (this goal comes from TFA, not just the summary). The use of resumes to accomplish this is an implementation decision. Any shortcomings of using resumes is not necessarily an inherent limitation to solving the task at hand. There are other ways to complete the task that do not rely on resumes at all, or that rely on them less.

            For instance, you could use web search results to obtain data for your distributional semantics research. Websit

            • your alternative method is inferior as the specific request is tech *skills*, which you find on resumes, people speaking to their merits to get hired

              not "tech appearing together on message boards," which indicated a whole host of relationships, relationship by skillset being far down the list

              the simple fact is there is no perfect methodology so criticizing the methodology for being imperfect is without merit. and in articulating a yet even more inferior methodology in your latest comment i have to assume yo

    • RDBMS is one of the largest circles though. It is located in the orange area which seems like the traditional, popular, programming skills area. The RDBMS circle doesn't have as many links from it as I thought it would, but that would be how the cutoff is implemented. It's pretty nice.

  • and i've seen my fair share of poor visualizations

    but i think this is actually really well done, useful even

    congratulations to Simon Hughes

    • by Anonymous Coward

      As one who goofs off with some of the Graphviz tools (dot), how possiblewould it be to dowith this toolchain?

    • by dwpro ( 520418 )

      I agree that it is a cool visualization. I do think it's noteworthy to mention that the data was reverse-engineered from resumes, so the relationships all have the bias of what individuals _think_ will matter to hiring committee or want to be hired to do, vs what actual skills they possess.

      • Read the blog post again. http://insights.dice.com/2015/... [dice.com]

        "I think that’s pretty cool, given we’re generating that automatically from job descriptions posted on our site. We also tried using the resume dataset, but the results were of a lower quality, as the skills extracted from resumes can be from different jobs."

        It was extracted from job-postings, which would only identify Schelling points in the hiring industry, not skill clusters common to people with certain desirable skill sets; in othe

        • by dwpro ( 520418 )

          Ah, you're right. Thank you for that correction.

          • Ah, you're right. Thank you for that correction.

            No problemo

            I miss subtle stuff all the time. I rely on really strict semantics in place of "not trusting people", which I have a hard time doing (data is data).

    • by sribe ( 304414 )

      but i think this is actually really well done, useful even

      To the extent that I want to bookmark this, just for reference as to the tools he used ;-

  • Quick! Hide it before they use it to generate H-1B targeting resumes having impossible combo's of skills to filter away citizens.

  • When did the right panel get subsumed by ads? Had this been happening for long and I just hadn't noticed? I checked out the site in chrome to see the site without scripts, and holy smokes has it gotten bad.

    I would not frequent this site if the normal experience was accompanied by a trojan condom video ad (among several others) on loop.

  • by tlambert ( 566799 ) on Thursday July 02, 2015 @04:53AM (#50031245)

    It's not very reliable data.

    They took the similarity vectors from the job postings, not from resumes, so rather than "what you're likely to know", they computed "what an employer is likely to want at the same time as wanting something else", and then declared that a similarity due to an already skewed cosine similarity metric. This happens because employers are more likely to copy other, similar job postings, or other job postings for companies in a similar business as them, or those of a company whose employees they wish to hire away.

    They claimed that they tried using resumes, but that the resulting data was not as "clean"; uh... duh?

    This visualization was not actually very useful, unless you are trying to design a resume to get yourself hired, regardless of your actual current capabilities.

    • It's a polite way of saying "stop putting crap in your resumes". Don't encourage people to do more of it ;-)

      In all seriousness though, have you ever tried to analyse unstructured text? It's hard. How would you realistically improve it? Do you start with a preconceived list of technology key words and count them in the resumes? People misspell words. Words have multiple meanings depending on context. Technologies change and you might miss key words. Or you could count recent Stack Exchange topic tags, but th

      • In all seriousness though, have you ever tried to analyse unstructured text? It's hard. How would you realistically improve it? Do you start with a preconceived list of technology key words and count them in the resumes? People misspell words. Words have multiple meanings depending on context.

        I've been writing code like this since 1985. Then, it was in LISP.

        It's actually trivial to me at this point. You end up with a meaning trie with differential probability vectors, and some of the roots wither away as you go down. Making a machine decision is harder, but not entirely impossible.

        I get incredibly annoyed at people like Lazlo Bock who want to put everyone's resumes into a form that basically allows Google (Lazlo Bock works for Google) or other companies to magically allow you to come into a n

  • C# is just as big as Linux? Whodathunk.

  • Buzzword association (Score:4, Interesting)

    by tomhath ( 637240 ) on Thursday July 02, 2015 @07:16AM (#50031597)
    I had to follow the link to Hughes' report to find how he created the list of inputs:

    by importing a pair-wise comma-separated list of skills and their similarity scores ...

    we’re generating that automatically from job descriptions posted on our site.

    So what this really shows is how often the same two buzzwords appear together in a job description posted on Dice.

    I found another comment in his report interesting:

    We also tried using the resume dataset, but the results were of a lower quality,

    I assume by "lower quality" he really means "people list every buzzword they can think of on the resumes posted on Dice".

    Given the inputs I wouldn't expect any surprises in the results. But that said, it's an interesting project and they did a very nice job with the visualization.

  • by methano ( 519830 ) on Thursday July 02, 2015 @07:39AM (#50031691)
    It's interesting that Apple and its product and technologies are not related to any of these things in any way.
  • Looked for two dots: C and SQL.

    C has a nice big dot, but connects to only a handful of extremely broad-scoped nodes.

    SQL, OTOH, has a tiny dot, and connects to just about everything on the chart.
  • Since when Palo Alto is a tech skill?

    Need to update my CV...

  • // ==UserScript==
    // @name No Nervals Lobster
    // @description Remove troll posts from Slashdot front page.
    // @include http://slashdot.org/*
    // @include http://.slashdot.org/*
    // @include https://slashdot.org/*
    // @include https://.slashdot.org/*
    // @exclude https://.slashdot.org/story/*
    // @exclude http://.slashdot.org/story/*
    // @grant none
    // ==/UserScript==

    var elements = document.getElementsByTagName('article');
    for(var i = 0; i < elements.length; i++) {
    var text = elements[i].textContent;

  • Where are the hardware skills?

    'Tech skills' don't mean much without hardware.

    Strat

I have hardly ever known a mathematician who was capable of reasoning. -- Plato

Working...