We’ve upgraded lemmy.world to 0.18.1-rc.1 and rolled back that upgrade because of issues.

(If you had posted anything in those 10 minutes between upgrade and rollback, that post is gone. Sorry!)

The main issue we saw is that users can’t login anymore. Existing sessions still worked, but new logins failed (from macos, ios and android. From linux and windows it worked)

Also new account creation didn’t work.

I’ll create an issue for the devs and retry once it’s fixed.

Edit Contacted the devs, they tell me to try again with lemmy-ui at version 0.18.0. Will try again, brace for some downtime!

Edit 2 So we upgraded again, and it seemed to work nicely! But then it slowed down so much it was unuseable. There were many locks in the database. People reported many JSON errors. Sorry, we won’t be on 0.18.1 any time soon I’m afraid…

  • rrobin@lemmy.world
    link
    fedilink
    arrow-up
    27
    ·
    1 年前

    As any engineer who does ops can tell you - you did the right thing - the solution is always to roll back, never force a roll forward, ever.

    We should totally do pre and post update parties though. Even if the update fails we can have an excuse for drinks and a fun thread.

    • T156@lemmy.world
      link
      fedilink
      arrow-up
      2
      ·
      1 年前

      Although since we seem to be rolling more than a ship in a storm, I think a proportion of lemmings would end up hospitalised for alcohol poisoning.

  • OsrsNeedsF2P@lemmy.ml
    link
    fedilink
    arrow-up
    23
    ·
    1 年前

    Both Dessalines and Nutomic have been working their butts off to get 0.18.x ready for the Reddit API changes. Huge hopes they can pull through!

    Dessalines:

    Nutomic:

    • imaqtpie@sh.itjust.works
      link
      fedilink
      arrow-up
      13
      arrow-down
      1
      ·
      1 年前

      Yeah they’ve been hard at work all month. But it’s also okay if things aren’t ready in time. Most of the people who matter are already here.

      Maybe we will blow up soon, maybe later, but the quality of content here is sufficient to drive growth regardless of whether or not we get the prophesized huge migrations from reddit

      • rambaroo@lemmy.world
        link
        fedilink
        arrow-up
        2
        ·
        1 年前

        It’s adequate the way it is now. They should hold off on any major upgrades until after the next batch of new users come in. Lemmy breaking during that period would be far worse then missing some updates and be features.

      • thegreatgarbo@lemmy.world
        link
        fedilink
        arrow-up
        1
        ·
        1 年前

        Agreed. I’m here to stay with the content from the last couple weeks. 3 day poop embargoes alone is worth it. And recognizing users because the community is still small enough is SO different from they who shall not be named.

    • egeres@lemmy.world
      link
      fedilink
      arrow-up
      2
      ·
      1 年前

      Oh god, so much coffee… it’s also thrilling to grasp a sense of what’s going on under the hood of such big social networks at a development level (not like I could understand it, but it was very interesting to see twitter’s recommendation algorithm being open-sourced)

    • axzxc1236@lemmy.world
      link
      fedilink
      arrow-up
      17
      ·
      edit-2
      1 年前

      Reasons I can think of:

      1. The official Android client for lemmy, Jerboa, only supports 0.18 and later, unless users download older version from github and sideload manually.

      2. Sorting is broken pre 0.18, new posts keeps flowing in.

      3. Performance improvement by removing web socket from lemmy. (which fixes 2, which is why 1. happens)

        • average650@lemmy.world
          link
          fedilink
          arrow-up
          3
          ·
          1 年前

          The latest version only supports 0.18 because the backend works differently. Older versions of Jerboa support older backends of lemmy.

        • astanix@lemmy.world
          link
          fedilink
          arrow-up
          2
          ·
          1 年前

          Likely because Jebora is written as a side project of the devs of lemmy so it’s always going to be cutting edge

          • Graphine@lemmy.world
            link
            fedilink
            arrow-up
            2
            ·
            1 年前

            I understand that. I’m not complaining about the quality. I’m just confused from a technical perspective why it doesn’t support rollback or older server versions in the event of…this.

            • Darorad@lemmy.world
              link
              fedilink
              arrow-up
              2
              ·
              1 年前

              Fixing the issues with posts appearing and scrolling the page the backend needed some pretty drastic rewrites. I’m not super familiar, but from what I’ve seen of the code, it would be a decent amount more work to support both versions.

              Would it have been worth it? Yes, but it wasn’t anticipated that devs would stay on 0.0.17 for more than a day or two. With the time it takes for app stores to update, servers would have been updated before lemmy 0.0.45 was updated for the vast majority of users. At most, it would be a day or two instead of a week or two.

      • JdW@lemmy.world
        link
        fedilink
        arrow-up
        4
        arrow-down
        1
        ·
        1 年前

        Jerboa works fine with lemmy world, it just gives a warning and crashes on occasion. Not an issue to use it though.

        • flawedFraction@lemmy.world
          link
          fedilink
          arrow-up
          4
          ·
          1 年前

          For me at least, I’m unable to change the community scope or sorting method. The only thing I can use Jerboa for is “local” and sorted by “active”. It isn’t totally unusable, but I would much rather be able to switch to subscribed and hot.

        • Disk@lemmy.world
          link
          fedilink
          arrow-up
          3
          ·
          1 年前

          Yes, if you are already logged in. If you are not logged in, or get logged out you will not be able to log back in until the server version is above 0.18

        • inverimus@lemmy.world
          link
          fedilink
          arrow-up
          3
          ·
          1 年前

          As a new user I tried logging in on Jerboa and it says the server must be on 0.18 and refuses the login.

        • Vanon@lemmy.world
          link
          fedilink
          arrow-up
          2
          ·
          1 年前

          “crashes on occasion”

          Unfortunately, the occasion seems to be at least once every five minutes for me. thisisfine.jpg This is not fine and definitely a major issue. Especially as we get closer to July 1. After this news from lemmy.world, I will have to switch, probably to Connect, until Sync arrives.

        • Guy_Fieris_Hair@lemmy.world
          link
          fedilink
          arrow-up
          1
          ·
          1 年前

          With the last couple updates jerboa has got to a point that I probably won’t even move to sync when it comes. I’m pretty happy with it. But if Lemmy Is Fun ever happens, I’m out.

        • Guy_Fieris_Hair@lemmy.world
          link
          fedilink
          arrow-up
          1
          ·
          1 年前

          With the last couple updates jerboa has got to a point that I probably won’t even move to sync when it comes. I’m pretty happy with it. But if Lemmy Is Fun ever happens, I’m out.

        • wheresyourshoe@lemmy.world
          link
          fedilink
          arrow-up
          1
          ·
          1 年前

          It won’t let me log in, so no, it doesn’t “work fine.” But I have six other apps for Lemmy, so I’m not worried. It’ll all be sorted eventually.

        • Cadenza@lemmy.world
          link
          fedilink
          arrow-up
          0
          ·
          edit-2
          1 年前

          Well you’re a lucky one. Jerboa has been unusable for me and other folks post v0.34 since the first rc release of 0.18. I finally managed to downgrade to 0.34 and login after lots of black magic. It used to crash instantly, even on 0.34.

          That being said, good luck to lemmy.world admins and thanks a lot for the open comms !

      • WhiskyTangoFoxtrot@lemmy.world
        link
        fedilink
        arrow-up
        1
        ·
        1 年前

        The official Android client for lemmy, Jerboa, only supports 0.18 and later, unless users download older version from github and sideload manually.

        Even that won’t work, because Jerboa initially tries to connect to lemmy.ml which is running 0.18, which older versions of Jerboa aren’t compatible with. The app just crashes instantly without giving you the opportunity to log in to your instance.

    • dragontamer@lemmy.world
      link
      fedilink
      English
      arrow-up
      15
      ·
      edit-2
      1 年前

      Federation is completely borked with .18 servers. Its very difficult for us to interact with https://lemmy.ca in any way: subscribing, upvoting, comments, posting… its all bugged.

      Its maybe not that big a deal because Lemmy.world “has the most users”, so in some regards its https://lemmy.ca’s loss but… we need to restore reliable federation… especially before the July 1st rush IMO.

      The .17 to .18 upgrade is basically a soft-defederation event, because of whatever this bug is between the two versions.

      • FearTheCron@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        1 年前

        Ah that explains it. Someone posted a cool photo to my community from lemmy.ca but didn’t interact further. Looks like my comment didn’t even show up on their end.

        Anyway, thanks to everyone working on the issue. I know these things aren’t easy.

        • dragontamer@lemmy.world
          link
          fedilink
          English
          arrow-up
          3
          ·
          1 年前

          I’m subscribed to lemmy.ca and programming.dev, both of which are .18 now. Feels kinda bad losing access to those communities while this issue is getting worked out…

    • Ruud@lemmy.worldOPM
      link
      fedilink
      arrow-up
      13
      ·
      1 年前

      Because of a few things, really anoying bugs and the Jerboa app not working properly with older versions

    • twistedtxb@lemmy.world
      link
      fedilink
      arrow-up
      7
      ·
      1 年前

      I suspect there will be a large influx of new users in two days, and that having Jerboa not working on .world might cause a few issues

    • Joe B@lemmy.world
      link
      fedilink
      arrow-up
      4
      arrow-down
      1
      ·
      1 年前

      Cause if you are on the homepage for a little bit new post start pushing the old post down and it keeps doing it. something about websockets that 18 fixes it. im assuming the admins want to get it out now so people can stop complaining about it. i get it though just wait for the 18 instead of the rc-1 but users are impatient and can’t wait for sh*t!

    • woelkchen@lemmy.world
      link
      fedilink
      arrow-up
      3
      ·
      1 年前

      Just curious, why are we updating now instead of waiting for the proper 0.18.1 release?

      A Release Candidate is supposed to be past beta testing already and in a state of no major bugs.

  • runswithjedi@lemmy.world
    link
    fedilink
    arrow-up
    14
    ·
    1 年前

    Thanks for all your work!

    Perhaps it would be beneficial to have a temporary test instance like dev.lemmy.world or something where you dedicate a limited amount of processing power to. That way you could test the update live but not affect the main instance.

    I have no idea about the specifics so that may not be doable.

    • Ruud@lemmy.worldOPM
      link
      fedilink
      arrow-up
      4
      ·
      1 年前

      Yeah we have a test instance, but not sure if we could test with this kind of load…

    • Ulu-Mulu-no-die@lemmy.world
      link
      fedilink
      arrow-up
      1
      ·
      1 年前

      There are already 3 test instances where devs push updates before going live, but:

      • if not many people go there test, they won’t catch all the bugs.
      • in this specific case, admins are trying to push a new release candidate (that is not officially released yet) because some people are stuck with mobile apps that don’t work anymore on the version we have now.
  • Jerti@lemmy.world
    link
    fedilink
    arrow-up
    14
    ·
    1 年前

    UI: 0.18.0 BE: 0.18.1-rc.1 👍

    seems to be working for me, I was already logged on

  • Nintendo@lemmy.world
    link
    fedilink
    arrow-up
    12
    ·
    1 年前

    appreciate the transparency. how are things looking in the back in lemmy.world (server wise)? will we get to a point where it wont require complete rollbacks on the state when a botched update gets rolled out?

    • Dandroid@lemmy.world
      link
      fedilink
      arrow-up
      3
      ·
      1 年前

      Not having to do a database rollback is a really, really hard problem to solve, and it would almost certainly need to be on the Lemmy developers side, not the server owner’s side. And if I’m them, that’s a low priority issue, and probably not something I even think about until 1.0.

      Basically, they write code that says what to do in the event of a database version change. Usually this only handles upgrade cases, because that’s what happening most of the time. One example of something you might do in a db upgrade is let’s say you had a column where the data type was only numbers, but now you want to allow any alphanumeric character for some reason. You could have a line of code that converts the number to a string.

      Okay, but now you need to go back to the previous version. Okay, your db change code runs, but it’s the old version of the db change code, not some new version that you wrote. You unfortunately didn’t have a crystal ball when you wrote this code and couldn’t predict that you were going to change the data to strings, so you didnt write code to change it from a string to a number.

      This is why most software doesn’t support downgrades unless you wipe first. For example, if you updated your aging MacBook to the latest Mac OS version, then realized it slows down your laptop too much, you can only go back if you first wipe your laptop in the process. So it’s just easier to just take a snapshot before an upgrade and revert to the snapshot if it fails. Some folks will even do “scheduled maintenance” time during the upgrade in which the whole system goes down for a short time so they don’t have to risk losing data that happened after the snapshot.

  • Sterben@lemmy.world
    link
    fedilink
    arrow-up
    12
    arrow-down
    1
    ·
    edit-2
    1 年前

    I am glad I didn’t write my last post during the upgrade process.

    Maybe next time, give a warning, or maintenance notice.

    Thanks for trying though.

  • naneek@lemmy.world
    link
    fedilink
    arrow-up
    11
    arrow-down
    1
    ·
    edit-2
    1 年前

    Thanks for the transparency. Maybe it’s a good idea to have a test instance and some test cases/validation done there before updating the main instance. This is a regular process in any software/tech company/stack.

    Testing should never be done directly on the prod instance.

  • pampoon@lemmy.world
    link
    fedilink
    arrow-up
    9
    ·
    1 年前

    I’ve found from hopping around some other instances that have upgraded to 0.18 that it is still pretty buggy. It does seem to be giving more information about the errors, instead of just failing like in 0.17, but spend any time browsing on those instances and you’re bound to be inundated with JSON and query errors. It also seems to get worse the longer you browse.

    The UI changes are nice, and I do appreciate not having my feed auto-updating constantly, but I think you’d be making the right choice to hold off on upgrading until they can iron 0.18.1 out all the way. I’m not super knowledgeable about TS and Rust, but as a user it seems that switching from WebSocket created/shined a light on Lemmy’s issues with caching in general.

    • cerevant@lemmy.world
      link
      fedilink
      arrow-up
      3
      ·
      1 年前

      but as a user it seems that switching from WebSocket created/shined a light on Lemmy’s issues with caching in general.

      Or just adding actual error messages instead of ignoring them and throwing up a spinning wheel.

  • Shadywack@lemmy.world
    link
    fedilink
    arrow-up
    9
    ·
    1 年前

    Running one of the largest and most vibrant Lemmy instances certainly makes this deployment a great test case for the devs and the system overall. @ruud@lemmy.world this community really appreciates you!