Hacker Read

wolfium3 | karma 194 | avg karma 5.71 · 2020-09-15 07:10:48

I have a dream... that one day people will name their projects with names that don't exist on google yet.

If you search for PEP now you'll find python enhancement proposals, and the "Philippine Entertainment Portal" and the stock code for PepsiCo.

reply

tzs | karma 45790 | avg karma 3.13 · 2020-09-15 12:47:59+00:00

I wish that once people do name their project, they would assign it a 128-bit random number in lower case hex, and include that number on any web page that they would like people searching for their project to find.

That way once I know that say PEP the PDF editor exists and find its 128-bit number (let's say that is 379dd864b16eaca3ce94c15a6bdfcc73), at least I can subsequently toss a +379dd864b16eaca3ce94c15a6bdfcc73 on my searches to effectively let the search engine know I want PEP the PDF editor results rather than PEP the python enhancement results or PEP the entertainment portal results or PEP that refreshing beverage company stock symbol.

"xxd -l 16 -p /dev/urandom" is a handy way to get a 128-bit random hex number. A UUID generator works, too, although they usually include some punctuation you will need to delete and you might have to lower case their output.

reply

MaxBarraclough | karma 10788 | avg karma 2.11 · 2020-09-15 13:01:12+00:00

> I wish that once people do name their project, they would assign it a 128-bit random number in lower case hex

We already have something similar: URLs.

reply

wtetzner | karma 4883 | avg karma 1.99 · 2020-09-15 13:25:04+00:00

Except a URL only points to one resource. The idea here is that this identifier would exist on any resource related to PEP (maybe even in URLs).

fiddlerwoaroof | karma 6113 | avg karma 2.37 · 2020-09-15 14:38:45

That’s not a problem, you just add a meta or a link tag that points to “the” url for your project (maybe og:app-id Or a link with rel=“app”)

notatoad | karma 23514 | avg karma 4.59 · 2020-09-15 21:13:32+00:00

>Except a URL only points to one resource

isn't that exactly what this is asking for though? A URL can by definition only point to one resource. So if you include that URL with every other reference to the project (in the app descriptions, blog posts about it, etc) then you always know you're talking about the same thing. It makes a lot more sense that any resource related to this PDF editor should include a link to "https://macpep.org" instead of including some random 128 character string. Any resource related to python peps should include a link to "https://www.python.org/dev/peps/" (which all PEPs do, by virtue of having a url that's a subdirectory of the PEP index URL)

reply

oh_sigh | karma 6383 | avg karma 0.89 · 2020-09-15 13:27:07+00:00

More in line with using "golang" since it is far easier to search for than just "go"

nickthemagicman | karma 251 | avg karma 0.08 · 2020-09-15 08:28:16

Yeah but urls and names might need to change due to marketing. A hash would uniquely id the project and let the marketing aspect be dynamic.

Although if the actual project name, authors, and codebase changes is it even still the same project?

reply

drivingmenuts | karma 2286 | avg karma 1.12 · 2020-09-15 13:57:14

Then maybe what really needs to happen is we burn all marketers in a pyre? So they quit fucking up things that ain’t fucked up?

nickthemagicman | karma 251 | avg karma 0.08 · 2020-09-15 15:36:05+00:00

There's a number of professions I'd like to add to that pyre please. ;)

gumby | karma 61183 | avg karma 3.86 · 2020-09-15 14:23:36+00:00

tzs is proposing URNs rather than textual program names. A URL is unnecessarily specific (though I suppose you could anycast URL resolution)

westurner | karma 3375 | avg karma 0.93 · 2020-09-15 23:55:37

> RFC 4122 defines a Uniform Resource Name (URN) namespace for UUIDs. A UUID presented as a URN appears as follows:[1]

> > urn:uuid:123e4567-e89b-12d3-a456-426655440000

https://en.wikipedia.org/wiki/Universally_unique_identifier#...

Version 4 UUIDs have 122 random bits (out of 128 bits total).

In Python:

  >>> import uuid
  >>> _id = uuid.uuid4()
  >>> _id.urn
  'urn:uuid:4c466878-a81b-4f22-a112-c704655fa4ee'

Whether search engines will consider a URL or a URN or a random str without dashes to be one searchable-for token is pretty ironic in terms of extracting relations between resources in a Linked Data hypergraph.

  >>> _id.hex
  '4c466878a81b4f22a112c704655fa4ee'

The relation between a resource and a Thing with a URI/URN/URL can be expressed with https://schema.org/about . In JSON-LD ("JSONLD"):

  {"@context": "https://schema.org",
   "@type": "WebPage",
   "about": {
     "@type": "SoftwareApplication",
     "identifier": "urn:uuid:4c466878-a81b-4f22-a112-c704655fa4ee",
     "url": ["", ""],
     "name": [
       "a schema.org/SoftwareApplication < CreativeWork < Thing",
       {"@value": "a rose by any other name",
        "@language": "en"}]}}

Or with RDFa:

  <body vocab="https://schema.org/" typeof="WebPage">
    <div property="about" typeof="SoftwareApplication">
      <meta property="identifier" content="urn:uuid:4c466878-a81b-4f22-a112-c704655fa4ee"/>
      <a property="url" href=""></a>
      <a property="url" href=""></a>
      <span property="name">a schema.org/SoftwareApplication &lt; CreativeWork &lt; Thing</span>
      <span property="name" lang="en">a rose by any other name</span>
    </div>
  </body>

Or with Microdata:

  <div itemtype="https://schema.org/WebPage" itemscope>
    <link itemprop="http://www.w3.org/ns/rdfa#usesVocabulary" href="https://schema.org/" />
    <div itemprop="about" itemtype="https://schema.org/SoftwareApplication" itemscope>
      <a itemprop="url" href=""></a>
      <a itemprop="url" href=""></a>
      <meta itemprop="identifier" content="urn:uuid:4c466878-a81b-4f22-a112-c704655fa4ee" />
      <meta itemprop="name" content="a schema.org/SoftwareApplication &lt; CreativeWork &lt; Thing"/>
      <meta itemprop="name" content="a rose by any other name" lang="en"/>
    </div>
  </div>

suprfsat | karma 447 | avg karma 2.3 · 2020-09-15 13:01:55+00:00

Instead of trolling HN you could contribute to Wikidata.

nickthemagicman | karma 251 | avg karma 0.08 · 2020-09-15 08:21:34

8 bit ascii could work.

8^256 is a huge number.

reply

thechao | karma 4578 | avg karma 3.54 · 2020-09-15 13:31:49+00:00

I can’t even this math.

nickthemagicman | karma 251 | avg karma 0.08 · 2020-09-15 15:47:37+00:00

I worded wrong.

colejohnson66 | karma 9319 | avg karma 1.93 · 2020-09-15 09:08:38

A 256 bit number is 2^256 possible combinations. 8^256 is the same as (2^3)^256 (or 2^768)

nickthemagicman | karma 251 | avg karma 0.08 · 2020-09-15 15:39:53+00:00

Er yeah. Derp.

8 character ascii not 8 bit.

It's early.

That's 8 bits ^ 8.

Or 256 ^ 8.

and easily able to be represented searching online with 8 characters.

reply

SiVal | karma 7044 | avg karma 5.51 · 2020-09-15 21:05:11+00:00

Don't get discouraged, but you might still have a bug or two to work out with your new and never-before-tried "256^8 and easily able to be represented searching online with 8 characters" design. For your beta test, here are some of the unique 8-char identifiers you might want to try searching for: `unique `, ` unique `, ` unique`, `Unique `, `u^Hunique`, `un^Hnique`, `uniq^H^H^H^H`, ` . . . .`, `. . . . `, `uniqueESCESC`, `BELBELBELBELBELBELBELBEL`, ...

nickthemagicman | karma 251 | avg karma 0.08 · 2020-09-17 23:13:19+00:00

Lol, thanks for your totally non sarcastic assistance. It needs some fleshing out.

nickthemagicman | karma 251 | avg karma 0.08 · 2020-09-19 20:22:52+00:00

If the alternative is a 32 char, 128 bit hex string..I think that's a little excessive to expect people to use especially when an 8 char ascii has way more variation and is way easier to remember.

687c066db3458f7cbd5cc8bd58a65c64.

Vs.

*Xrh6x1!

You just have to eliminate dictionary words.

reply

Fnoord | karma 8086 | avg karma 1.35 · 2020-09-15 08:27:02

If you want something like that with binaries, Nix(OS) might be for you.

romanoderoma | karma 242 | avg karma 0.47 · 2020-09-15 13:48:23+00:00

That's more or less what Ted Nelson envisioned in Xanadu and that's why he usually says that modern cut and paste has nothing to do with the real cut and paste and he consider it “a crime against humanity.”

Even though he's friend with Larry Tesler, the man responsible for our modern use of cut and paste

Links should bring back to the original source not point to some random text, with no context, that needs to be indexed

reply

vorpalhex | karma 16037 | avg karma 3.32 · 2020-09-15 10:21:13

That's actually a pretty solid idea. A meta tag for topics.

Only issue would be handling inevitable "SEO-ified" abusers of it.

reply

kps | karma 7825 | avg karma 2.95 · 2020-09-15 10:32:47

Simple solution: 379dd864b16eaca3ce94c15a6bdfcc73™

mulmen | karma 13520 | avg karma 2.38 · 2020-09-15 17:17:37+00:00

Not sure I follow. Are you saying that by trademarking the fingerprint you can prevent SEO abuse?

The problem with any SEO mitigation is that the 128 bit string is intended for SEO. If you make a cool new thing then I blog about it I want to use your 128 bit string and you want me to use it too! So how do you prevent someone else from putting it on a linkfarm? I don't think trademark helps there.

reply

justaguy88 | karma 393 | avg karma 2.14 · 2020-09-16 00:27:19+00:00

If your page has 100 of these 128 bit strings in them then it's even clearer that its a link farm page that can be downranked

tgbugs | karma 969 | avg karma 4.09 · 2020-09-15 19:01:52+00:00

This solves the namespacing problem and allows creators and consumers to use different names if they want. Searching based on the creator's original name for a project becomes a mess because there will be a very large number of HelloWorld applications out there. Interestingly enough the google web store sort of already does this. The issue that comes up fairly quickly though is how to deal with the relationships between different packaged and published versions of what is nomalinally the same code base, or even forks/branches of the same code base. Maintaining a verifiable and discoverable chain for published artifacts without completely confusing users or exposing them to various malicious attacks (change a single byte in the middle of that random string and you have a nice off-by-one attack). Lots of infrastructure would be required to pull this off, but it would be great if it could be built.

kalium-xyz | karma 1083 | avg karma 3.24 · 2020-09-15 15:25:45

I too dream of content addressable web.

justaguy88 | karma 393 | avg karma 2.14 · 2020-09-16 00:29:41+00:00

https://docs.ipfs.io/how-to/websites-on-ipfs/single-page-web...

sbr464 | karma 1576 | avg karma 1.84 · 2020-09-15 20:32:06

That's actually a pretty good idea. Kind of like an official @mention/#hashtag for an exact topic, if somehow wasn't abused by people, would definitely improve related search results. Navigating user intent algorithms is getting more difficult.

Does schema.org etc support ids beyond keywords/categories? I guess the id could just be a keyword.

Maybe a public registry where you claim an id for a topic, similar to claiming a yelp page or an ISBN number. Then anyone posting related content includes that id. Popular topics could be grouped. You could generate memorable ids for most known topics/products/etc, and people just utilize them organically, robots could apply them automatically over time also.

It's especially bad for words with many definitions, like "bridge repair", could mean a dental bridge, guitar bridge, or a bridge over a lake.

reply

devenblake | karma 1054 | avg karma 4.27 · 2020-09-16 01:43:51+00:00

Just added one to a project of mine[0] (just a YouTube browser using the RSS thing YouTube does). Hope it catches on!

[0] - https://github.com/devenblake/ytfeed.py

reply

gjvnq | karma 236 | avg karma 1.35 · 2020-09-16 11:43:17

I kind of do this in blog. Each page is assigned an UUID.

dubcanada | karma 4308 | avg karma 3.03 · 2020-09-15 13:04:45+00:00

A name that doesn't exist on google? So what exactly would that be?

It is very obvious if you google "pdf pep" python enhancement proposals or pepsi is not going to show up.

Naming is hard, I have a dream that people would stop complaining about it. There is names/acronyms for literally everything, the chance of you finding something unique is very very small.

reply

HumblyTossed | karma 2989 | avg karma 2.55 · 2020-09-15 13:25:35+00:00

Google > pdf pep

Nope. First page is mostly CDC, WHO, etc. Nothing about python or this project.

reply

threcius | karma 153 | avg karma 5.28 · 2020-09-15 08:28:06

Yep, naming is hard. And I like the name Wine most.

(Wine is not emulator)

Thank you for understanding. :)

reply

colejohnson66 | karma 9319 | avg karma 1.93 · 2020-09-15 11:16:01

“GNU’s Not Unix”

Fnoord | karma 8086 | avg karma 1.35 · 2020-09-15 13:33:55+00:00

Everybody's Google results are different these days. They put you in a bubble. Hence, saying "if you Google you should see result X or Y" isn't necessarily true.

I would say for acronyms containing 2, 3, 4 letters these are all going to be taken at this point.

What matters is how much do the acronyms overlap. Pepsi (food & drink) has nothing to do with PDF editors (tech).

pEp (or p=p) [1] on Android is a nice K-9 fork with material design and GPG support / opportunistic encryption. Its not very well known though.

Worst would've been if there's a PEP directly related to PDF.

[1] https://www.pep.security

reply

threcius | karma 153 | avg karma 5.28 · 2020-09-15 08:44:58

First time seeing a .security website in my life.

kzrdude | karma 11414 | avg karma 2.35 · 2020-09-15 08:10:45

We need to find an actionable suggestion. Maybe projects can have a long name and a nick name? To make everyone happy, and of course, to be useful for everyone.

Avamander | karma 4204 | avg karma 1.56 · 2020-09-15 23:38:13+00:00

pePDF? Seems relatively unused compared to pep.

crazygringo | karma 69969 | avg karma 7.48 · 2020-09-15 08:45:19

Unfortunately that will never happen.

The good news is google is smart, and if you add a couple of subject keywords it pretty much always works.

For example if you search "pep pdf editor" the site shows up in first place.

My only issue is naming things after words that are so incredibly common they're on practically every page anyways, and thus truly useless for searching. I'm looking at you, Go.

reply

threcius | karma 153 | avg karma 5.28 · 2020-09-15 13:53:10+00:00

I did not expect googling "pep pdf editor" will show my site in the first place, because i just published the link of my site in a few days.

threcius | karma 153 | avg karma 5.28 · 2020-09-15 13:54:04+00:00

And google is smart as it as you said.

tyfon | karma 3825 | avg karma 3.63 · 2020-09-15 14:30:47+00:00

PEP is also "Politically exposed person" in anti money launcering circles, and often the lists we get are PDFs.

Seems like PEP is used for more things too :)

reply

5mk | karma 23 | avg karma 3.29 · 2020-09-15 09:44:55

I study biochemistry, so “phosphoenolpyruvate” was the first thing to come to mind, ha.

narwally | karma 330 | avg karma 1.53 · 2020-09-15 16:20:29+00:00

Also Python Enhancement Proposal

yjftsjthsd-h | karma 28510 | avg karma 2.78 · 2020-09-15 20:03:04

that was already pointed out upthread from you

yitchelle | karma 6998 | avg karma 2.83 · 2020-09-15 10:33:44

In previous times, I worked for a company that had the acronym AAPL. I kept getting the stock quotes for Apple every time I browse the company's intranet.

Avamander | karma 4204 | avg karma 1.56 · 2020-09-15 23:38:50+00:00

I once had to google something about the Thread protocol. Impossible. I think I haven't seen a worse name for a tech project.

davidandgoliath | karma 542 | avg karma 1.35 · 2020-09-15 11:16:26

Keep dreaming. :)

d4mi3n | karma 1926 | avg karma 3.61 · 2020-09-15 19:19:08+00:00

I usually solve this by giving the big G (or the big Duck) more info to work with in the query: https://duckduckgo.com/?q=pep+pdf+mac

Semiapies | karma 5867 | avg karma 1.94 · 2020-09-15 15:45:51

I have a dream that dang will automate flagging and removing the inevitable inane comments about name uniqueness every time someone posts a project on HN.

Avamander | karma 4204 | avg karma 1.56 · 2020-09-15 23:36:56+00:00

Once you end up investigating an issue with something that namesquats another thing you'll understand.

Semiapies | karma 5867 | avg karma 1.94 · 2020-09-16 00:37:04+00:00

Yes, having only been doing this twenty years, I'm unlikely to be versed in doing that.

Johnny87 | karma 10 | avg karma 0.91 · 2020-09-15 22:31:38+00:00

It's like these companies can't hire marketing reps to establish a brand image that's new