Settings

Theme

That company whose name used to contain HTML script tags Ltd

find-and-update.company-information.service.gov.uk

436 points by marinintim 5 years ago · 163 comments

Reader

jsty 5 years ago

Nice to see Bobby Tables is all grown up.

Relevant discussion on the Companies House Developer Forum:

https://forum.aws.chdev.org/t/cross-site-scripting-xss-softw...

  • ben_w 5 years ago
    • tcgv 5 years ago

      I was wondering how much it costs to open a company in the UK to do something like that, and it seems to be really cheap (and quick):

      a) Incorporate directly via Companies House

      The standard registration fee to set up a company is just £12 for the ‘standard’ Companies House web incorporation service, which takes up to 24 hours to turnaround. You can pay via credit card, debit card or PayPal.

      Source: https://www.itcontracting.com/how-much-limited-company-cost/

      • pjc50 5 years ago

        Yes, it is. Do they still require a minimum of two officers?

        (Spam warning: if you form a company at your home address, you will be inundated with paper spam for office equipment. Especially from Dell.)

        • ed_elliott_asc 5 years ago

          I have a ltd company registered at home and I got sent a pen with the company name engraved on it - nice spam :)

          I also had a letter telling me I was the beneficiary of a few million dollars which was less useful :)

        • tim333 5 years ago

          No. You can do it with one person.

      • gerdesj 5 years ago

        It is easy to set up a company in the UK and quite rightly so. If it wasn't then the economy might suffer.

        Issues such as what that company does after incorporation are another matter.

    • entrep 5 years ago

      Not many DBMS which support stacked queries I'll guess.

    • dilly_bar 5 years ago

      Hilarious!

  • ashleyn 5 years ago

    This reminds me of a story I saw on Reddit once. A man worked for a payment processing firm that didn't sanitize their database inputs at all.

    One day, they get a new customer called "Select". Absolutely everything stopped working.

  • Nextgrid 5 years ago

    I'm disappointed that the discussion seems more about debating whether that person acted in good faith or that the law regarding acceptable characters in company names should be changed, as opposed to the bigger concern of why were they not sanitizing company names? Even without intent to insert HTML, characters such as < or > would still break their pages.

bearbin 5 years ago

Relevant context: https://twitter.com/zofrex/status/1319286955314614275

Apparently the name used to be

    \"><SCRIPT SRC=MJT.XSS.HT></SCRIPT> LTD
  • achairapart 5 years ago

    New name is disappointing. The company should at least be renamed to:

        \&quot;&gt;&lt;SCRIPT SRC=MJT.XSS.HT&gt;&lt;/SCRIPT&gt; LTD
  • outsidetheparty 5 years ago

    hitting mjt.xss.ht returns this:

    /* THIS SUBDOMAIN HAS BEEN BANNED FROM THE XSS HUNTER SERVICE.

    WE DO NOT ALLOW ABUSE OF OUR SERVICE, ALL SECURITY TESTING MUST BE AUTHORIZED.

    Please use our contact form if you believe this ban was a mistake: https://xsshunter.com/contact */

  • blepblep 5 years ago

    Will that even work without http:// or at least // in front of the domain name?

    Tried it in chrome and sees it as a file name on the current domain.

    • tim333 5 years ago

      Seems to have a bit. Cut and paste from the guy who set up \"><SCRIPT SRC=MJT.XSS.HT></SCRIPT> LTD

      ...

      >I am in the process of contacting every website that has triggered my script which has a readily available contact for submitting security issues, or a hackerone account or similar. Alas, the sort of websites that have XSS problems rarely list IT security contacts.

    • wahern 5 years ago

      I don't think so. The traditional, canonical regular expression[1] for parsing a URL is

        ^(([^:/?#]+):)?(//([^/?#]*))?([^?#]*)(\?([^#]*))?(#(.*))?
      
      See https://tools.ietf.org/html/rfc3986#appendix-B

      The authority section (which contains the host domain) must begin with "//" whether there's a scheme prefix or not. Otherwise it's just part of the path (or query or fragment). IIRC, these semantics are also fixed by HTML such that any attribute like HREF or SRC is parsed as-if using the canonical regex (but after entity substitution and whitespace trimming). Browsers might have implemented this differently many years ago, but I doubt it as it would conflict with being able to use a bare path atom (e.g. foo.html).

      [1] I normally eschew using regular expressions for proper parsing, but for URLs the canonical expression is both adequate and advisable for correctness.

    • bearbin 5 years ago

      It had HTTP originally, twitter just munged it.

  • jtsiskin 5 years ago

    You would think if you go to all the trouble to register a company name, you would at least use a domain you control

habosa 5 years ago

There's a great club in Berlin called "://about blank"

So if you're gonna Google it, don't use the URL bar.

  • DaiPlusPlus 5 years ago

    My favourite social establishment is The Progress Bar.

    • paledot 5 years ago

      Nice spot, but the lines are always so long. Fortunately you always know how long you have to wait.

jjar 5 years ago

Fun fact: It's currently law that you can have the <> and "" characters in your name.

https://www.legislation.gov.uk/uksi/2015/17/schedule/1/made

hliyan 5 years ago

On a related topic, the name of the company I work for starts with a colon. The rest of the name is a common adjective. As you can imagine, it is virtually ungooglable at the moment. Any thoughts on how to get around this?

  • mkl 5 years ago

    Just change the name. It'll be easier and more cost effective in the long run.

    I've started working with some software called STACK recently, and it's almost impossible to find anything by searching (go ahead and try!). If it was a commercial product they would be sunk.

    • tlavoie 5 years ago

      At least if it's Haskell's stack build tool, adding the language name to the query makes it pop right up. Not sure if this helps in your case though.

  • joubert 5 years ago

    I found your company name in your profile - ":different"

    Even if I google ":different" company or without the colon, top results for me is a parfumerie.

    When I google different australia, you're the top result.

  • business007 5 years ago

    better find a nickname, like D's community did, using dlang lol otherwise no one could find anything on language googling

    • danellis 5 years ago

      The Go community had to do the same. Go. A language created by a company that makes a well known search engine.

      • abrowne 5 years ago

        But by staff who has worked on Plan 9, right?

      • aasasd 5 years ago

        Those people made it so that one-letter identifier names and junk like ‘fmt’ and ‘Fprintln(w)’ is again okay. So the unusable name fits the spirit quite well.

      • GauntletWizard 5 years ago

        Sometimes obscurity, as well as tweaking your nose at convention, is a feature.

    • schwartzworld 5 years ago

      how hard would it be to learn BASIC using Google? "Basic Programming" is going to have a lot of irrelevant results.

      • iso1631 5 years ago

        DDG'd

          BASIC programming
        
        all the results were about BASIC, the wiki page to start with, then

        * BASIC Programming : 7 Steps - Instructables

        * Learn More - Just BASIC

        * The History of the BASIC Programming Language

        * Programming in BASIC: the absolute beginner tutorial

        * FreeBASIC Language | Home

        * PureBasic - A powerful BASIC programming language

        * Quite BASIC — fun, learning and nostalgia

        * World of Spectrum - Documentation - ZX Spectrum manual

        Only after all that is a non-basic link

        * Introduction | Programming for Beginners

        • ithkuil 5 years ago

          Searched BASIC programming on Google and if also returned results relevant to the actual programming language

          • mdifrgechd 5 years ago

            BASIC programming ironicly may return better results than if you search for something about a more mainstream current language e.g. python. I often find the first few results are some search engine spam... tutorialspoint or geeksforgeeks etc, when a link to the API would be the logical first result. (Usually the first link to the api is for 3.4 or some random version also)

      • OscarCunningham 5 years ago

        I've often wondered if any metallurgists have tried to run computer simulations of the annealing process. How would you find their research if they had?

        • smartscience 5 years ago

          Actually yes :) At least the optimization crowd don't use the phase 'heat treatment', which helps somewhat. But who I really feel bad for is the recruiters trying to hire a chemist who specialises in the element lead.

          • bigiain 5 years ago

            BRB: going to SEO spam my simulated annealing NPM library homepage with "heat treatment"...

      • identity0 5 years ago

        Try it! Nearly every result is related to BASIC. Search engines have gotten very good at guessing what is and isn't a proper noun over the years.

      • rodw 5 years ago

        I don't know whether they actually do it but it seems really easy to treat "BASIC" as a distinct idiomatic token from "basic" when the searcher bothers to get the casing right.

  • Wowfunhappy 5 years ago

    It really is amazing how big a difference this makes.

    I've started using Apple's Aperture software recently (I'm well aware it's been discontinued). I really like it, but my biggest frustration is that it's difficult to learn how to do new things, because "aperture" is a generic word in photography. I can't search for the name and get results about the software.

    • LeifCarrotson 5 years ago

      One of my favorite mobile games is Antiyoy, by Yiotro (https://github.com/yiotro/Antiyoy) who also created other games like Vodobanka, Achikaps, and Bleentoro.

      The creator mentioned that he picked the names because they were pronounceable, unique, memorable, and searchable. That misses out on meaningfulness and familiarity, but those are expensive - by dropping those requirements, you gain easy SEO, trademarks, domains, etc. A big company knowing they're going to sell millions of copies can spend 5 figures on a domain and 6 figures on SEO, but I don't think it's worth it for most startups.

    • dqv 5 years ago

      >I'm well aware it's been discontinued

      Limiting the dates to indexes before 2016 might help (at least with google). You can usually train google to get you what you want after a few searches. This was initially a problem with the Elixir programming language, but it learned what I actually wanted it started letting me just type in the term elixir without specifying it was a programming language. On other computers not associated with that account, it does revert back to the not-so-useful results.

      e.g.

          apple "aperture" color correction before:2016
      • Wowfunhappy 5 years ago

        > but it learned what I actually wanted it started letting me just type in the term elixir without specifying it was a programming language

        Oh, you know what, this might be largely my own fault. I purposefully use Startpage.com as my search engine in order to avoid getting customized results (while still using Google's index).

        I worry that customized results put me in a filter bubble—but they certainly have their advantages!

    • RandallBrown 5 years ago

      The band Chvrches chose to use the the Roman "u" to spell their name so they'd be easier to search.

      • Freak_NL 5 years ago

        Kind of hard to pronounce though. Like that jewellery brand, seemingly pronounced 'buffelgary' or something.

        • RandallBrown 5 years ago

          Sorta. It's still pronounced "churches", but it's definitely a common joke to pronounce it chivurches.

    • bigiain 5 years ago

      See also "Pages", "Numbers", "Keynote"...

      Apple don't give a fuck.

      • Wowfunhappy 5 years ago

        No lies detected, but because they aren't professional software I don't have to search for stuff as often. And the other "Professional" Apple app I use is Final Cut Pro, which doesn't similarly have this problem.

  • saalweachter 5 years ago

    Back when I worked at a comparison shopping engine, I had a bit of a laugh when I saw that the indexing pipeline was generating error messages because the "clean" function returned empty for some products in the feed from Amazon, because they had names like "++++++".

    It was usually musical albums that liked to have names that made it impossible for fans to find the music.

  • Scoundreller 5 years ago

    Don’t go with !!!

    You’ll never find them on YouTube or google unless you search for their informal name: chk chk chk

    • yaboy 5 years ago

      However, they benefitted greatly in the early ‘00s. If you had them in your Apple Music library, iTunes always put them at the top of your alphabetical music library, keeping them top of mind, ! comes before A. There might have a similar iTunes Store benefit too.

      Terrible Google SEO, great accidental Apple SEO.

    • LeoPanthera 5 years ago

      Google have a hardcoded exception for the band "the the".

      • cpeterso 5 years ago

        SiriusXM truncates a "The" prefix from artist names (so "The Cure" and "The Who" become "Cure" and "Who"). I always wondered how it would display The The. Would it be "The The" (special case), "The" (default removal of "The"), or an empty string "" (in the unlikely case the algorithm recursively removed "The" prefixes)? Eventually they played a The The song and the answer is "The The".

        • rodw 5 years ago

          I always liked to imagine filing "The The" under "The, The".

          • bigiain 5 years ago

            Of course the right way of filing them is autobiographically... (I went to see the Infected tour with Louise, so they're filed under "L"...)

    • surround 5 years ago

      There’s a video on YouTube with three full-width explanation points as the title. I watched it once, and although it wasn’t particularly interesting, it bugs me that I cannot find it again.

  • jkingsbery 5 years ago

    Convince the powers-that-be in your company to invest in contracting with a marketing/SEO person or team to help come up with a new name. You want someone with marketing chops so that it's a good name, but you also want someone who knows about SEO so you don't end up on the second page for your own name search.

  • Zelphyr 5 years ago

    Is it "Colon Blow"?

  • achairapart 5 years ago

    Launch a multi-million dollar brand advertising campaign.

  • mtmail 5 years ago

    And I thought Yahoo! (with exclamation mark) had it rough.

    • LeifCarrotson 5 years ago

      At least they're a little better than a hypothetical -"Yahoo" which would return no results for your company at all...

      • tgb 5 years ago

        I was curious what googling only a negative query would do and for this, -"Yahoo" returns just the dictionary definition of the word "yahoo" and no search results.

  • skocznymroczny 5 years ago

    :oscopy ?

razster 5 years ago

I registered <b>Be</b> for a small company a long time ago. I thought it was clever. They folded during the tech bubble.

  • dylan604 5 years ago

    clearly, it was older as now it would be encouraged to use <strong>Be</strong>

  • daveslash 5 years ago

    <blink>Blink</blink> -- It's a deprecated tag now, and most browsers don't support it, but I would have loved to have seen that as a company... <marquee> is still supported though....

  • 7952 5 years ago

    Bold bee?

    • znpy 5 years ago

      Be bold ?

      • surround 5 years ago

        𝗕𝗲

        • azinman2 5 years ago

          I’ve never seen bold in an HN comment before? How do you do it?

          • roddds 5 years ago

            𝐔𝐧𝐢𝐜𝐨𝐝𝐞 𝐡𝐚𝐬 𝐜𝐡𝐚𝐫𝐚𝐜𝐭𝐞𝐫𝐬 𝐭𝐡𝐚𝐭 𝐥𝐨𝐨𝐤 𝐬𝐢𝐦𝐢𝐥𝐚𝐫 𝐭𝐨 𝐟𝐨𝐫𝐦𝐚𝐭𝐭𝐞𝐝 𝐭𝐞𝐱𝐭.

            𝗨𝗻𝗶𝗰𝗼𝗱𝗲 𝗵𝗮𝘀 𝗰𝗵𝗮𝗿𝗮𝗰𝘁𝗲𝗿𝘀 𝘁𝗵𝗮𝘁 𝗹𝗼𝗼𝗸 𝘀𝗶𝗺𝗶𝗹𝗮𝗿 𝘁𝗼 𝗳𝗼𝗿𝗺𝗮𝘁𝘁𝗲𝗱 𝘁𝗲𝘅𝘁.

            𝑈𝑛𝑖𝑐𝑜𝑑𝑒 ℎ𝑎𝑠 𝑐ℎ𝑎𝑟𝑎𝑐𝑡𝑒𝑟𝑠 𝑡ℎ𝑎𝑡 𝑙𝑜𝑜𝑘 𝑠𝑖𝑚𝑖𝑙𝑎𝑟 𝑡𝑜 𝑓𝑜𝑟𝑚𝑎𝑡𝑡𝑒𝑑 𝑡𝑒𝑥𝑡.

            𝘜𝘯𝘪𝘤𝘰𝘥𝘦 𝘩𝘢𝘴 𝘤𝘩𝘢𝘳𝘢𝘤𝘵𝘦𝘳𝘴 𝘵𝘩𝘢𝘵 𝘭𝘰𝘰𝘬 𝘴𝘪𝘮𝘪𝘭𝘢𝘳 𝘵𝘰 𝘧𝘰𝘳𝘮𝘢𝘵𝘵𝘦𝘥 𝘵𝘦𝘹𝘵.

            𝑼𝒏𝒊𝒄𝒐𝒅𝒆 𝒉𝒂𝒔 𝒄𝒉𝒂𝒓𝒂𝒄𝒕𝒆𝒓𝒔 𝒕𝒉𝒂𝒕 𝒍𝒐𝒐𝒌 𝒔𝒊𝒎𝒊𝒍𝒂𝒓 𝒕𝒐 𝒇𝒐𝒓𝒎𝒂𝒕𝒕𝒆𝒅 𝒕𝒆𝒙𝒕.

            𝙐𝙣𝙞𝙘𝙤𝙙𝙚 𝙝𝙖𝙨 𝙘𝙝𝙖𝙧𝙖𝙘𝙩𝙚𝙧𝙨 𝙩𝙝𝙖𝙩 𝙡𝙤𝙤𝙠 𝙨𝙞𝙢𝙞𝙡𝙖𝙧 𝙩𝙤 𝙛𝙤𝙧𝙢𝙖𝙩𝙩𝙚𝙙 𝙩𝙚𝙭𝙩.

            (Don't do this: it's _terrible_ for accessibility, as screen readers can't parse these as regular text)

            • kortex 5 years ago

              Can confirm. All I see are [X]'s.

              𝐔𝐧𝐢𝐜𝐨𝐝𝐞 𝐡𝐚𝐬 𝐜𝐡𝐚𝐫𝐚𝐜𝐭𝐞𝐫𝐬 𝐭𝐡𝐚𝐭 𝐥𝐨𝐨𝐤 𝐬𝐢𝐦𝐢𝐥𝐚𝐫 𝐭𝐨 𝐟𝐨𝐫𝐦𝐚𝐭𝐭𝐞𝐝 𝐭𝐞𝐱𝐭.

              I'm curious if that copied the text or the placeholders. It's like hunter2 for the modern era.

              • TimWolla 5 years ago

                > I'm curious if that copied the text or the placeholders.

                It copied the text.

            • gwittel 5 years ago

              FWIW for most of these sorts of things you can scrub it via passing the text through an NFKC or NFKD transform. I'd hope that a screen reader can be updated to handle this case.

          • nereye 5 years ago
          • mkl 5 years ago

            These are Unicode characters intended for use in mathematical formulas, not text, so they break all sorts of things. It might make some sense to use them in mathematical Python code (where they do seem to work), but they're hard to type.

          • wccrawford 5 years ago

            Looks like Unicode characters to me.

      • Wistar 5 years ago

        Be Best

ryukafalz 5 years ago

I giggled at the previous company name being redacted as “[NAME AVAILABLE ON REQUEST FROM COMPANIES HOUSE]” too. Not sure if that’s a common thing to do or if they made an exception for these shenanigans so they didn’t have to display the XSS.

  • addaon 5 years ago

    Now someone just has to register "[NAME AVAILABLE ON REQUEST FROM COMPANIES HOUSE]" as a company name...

    • dawnerd 5 years ago

      Kinda tempted to see if Oregon would let me register [REDACTED], LLC

      Only 100 dollars...

      Edit: Someone beat me to it. Reg #1330411-94

_4gzn 5 years ago

Here is to the company

  Dariusz Jakubowski x'; DROP TABLE users; SELECT '1
that ran in Poland from 2014 to 2019 [0].

[0] https://prod.ceidg.gov.pl/CEIDG/ceidg.public.ui/SearchDetail... (check the reCAPTCHA and click "Dalej")

pbhjpbhj 5 years ago

For a minute I thought their name was (without the quotes) "[NAME AVAILABLE ON REQUEST FROM COMPANIES HOUSE]".

Seems like a regulation to add "computer code like expressions" to the list that requires prior approval of the Secretary of State might be useful.

  • r-w 5 years ago

    That might require a bit more effort on the part of a company I know, who would have to register their logo "moz://a".

    • pbhjpbhj 5 years ago

      That would be a trade mark rather than a company name in this instance I think. In UK registered trademarks are standard type-written letters for word marks. If they contain symbols then they're figurative marks and it's an image of the mark which is registered.

      On that point though, searching on the UK trademark registry it looks like it just strips non-alphanumeric symbols. A search for "Moz://a” returns "moza".

yobert 5 years ago

And the officer's last name is Tandy. Good computer name :D

proactivesvcs 5 years ago

I'm slightly surprised that AAISP's RevK isn't behind this one. I'd document his shenanigans but they'd eventually break out of the data set.

MonadIsPronad 5 years ago

Now _that_ is pretty funny. I wonder what the company name is. Even the PDF of the incorporation certificate doesn't show the name.

gpvos 5 years ago

Just never change the name to "null".

mathieuh 5 years ago

Is Companies House's website not done by GDS or something? I worked on a few GDS projects for DFT, we had to have independent pen testers test our services before they moved between phases.

tshanmu 5 years ago

Would be it interesting to know why the name had to be changed?

mocatta 5 years ago

Relatedly, several years ago I scraped all companies on the old companies house webcheck site. There were two that interrupted my scraper: both contained '<' in the company name, and both seemed to take the webcheck service offline for a few seconds whenever I requested their pages. I can't say for sure - it might have been a temporary IP block I suppose - but it amused me nonetheless.

kalium-xyz 5 years ago

This is not a prank, its the new Companies house search which still contains some kinks and I assume they had to work around a very specific one here.

wodenokoto 5 years ago

What was the name? It’s former name is listed as “[NAME AVAILABLE ON REQUEST FROM COMPANIES HOUSE]” and it’s current name doesn’t contain any HTML tags (it’s literally the same as the headline)

Those are both funny and confusing names, but they don’t warrant comparison to sql-injections, so I am guessing there’s another name with actual HTML tags.

agys 5 years ago

There was also this project of Mediengruppe Bitnik (check the video with some online bookstores):

http://p-dpa.net/work/script-alert-mediengruppe-bitnik/

daniaal 5 years ago

This company is class. https://find-and-update.company-information.service.gov.uk/c...

ineedasername 5 years ago

Like little Bobby Drop Tables: https://xkcd.com/327/

tomcat27 5 years ago

hahahahaahaha! is this some prank?

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection