Blog | Page 3 | Mark Ghuneim

14 Jun 2024

MDG Blog Posts Permalink

Synthetic Media – Luma

MDG June 14, 2024

Luma test – 5 clips 5 seconds each from prompts, concatenated and added a 60-Second Audio Clip

11 Jun 2024

MDG Blog Posts Permalink

Mediaeater Digest Vol.30, No. 163

MDG June 11, 2024

States Take Up A.I. Regulation Amid Federal Standstill (nyt) State lawmakers across the country have proposed nearly 400 new laws on A.I. in recent months, according to the lobbying group TechNet. California leads the states with a total of 50 bills proposed, although that number has narrowed as the legislative session proceeds. (ed note- don’t worry – they will get it done like ad-tech, privacy, social media and other key regulations – dripping sarcasm)

Meta says European data is essential for culturally relevant AI (stackdiary) Meta says it wants to “push to develop AI that understands and reflects European cultures, languages, and humor,” which sounds promising on the surface. However, the approach

Deception abilities emerged in large language models (pnas) This study unravels a concerning capability in Large Language Models (LLMs): the ability to understand and induce deception strategies. As LLMs like GPT-4 intertwine with human communication, aligning them with human values becomes paramount. The paper demonstrates LLMs’ potential to create false beliefs in other agents within deception scenarios, highlighting a critical need for ethical considerations in the ongoing development and deployment of such advanced AI systems. (seems impt -ed)

Private Cloud Compute: A new frontier for AI privacy in the cloud (apple.com) Apple Intelligence is the personal intelligence system that brings powerful generative models to iPhone, iPad, and Mac. For advanced features that need to reason over complex data with larger foundation models, we created Private Cloud Compute (PCC), a groundbreaking cloud intelligence system designed specifically for private AI processing. For the first time ever, Private Cloud Compute extends the industry-leading security and privacy of Apple devices into the cloud, making sure that personal user data sent to PCC isn’t accessible to anyone other than the user — not even to Apple.

Apple’s On-Device and Server Foundation Models (machinelearning.apple.com) We protect our users’ privacy with powerful on-device processing and groundbreaking infrastructure like Private Cloud Compute. We do not use our users’ private personal data or user interactions when training our foundation models.

MacOS Sequoia to Allow iCloud Logins in Virtual Machines on ARM Macs (developer.apple.com) If someone moves a VM to a different Mac host and restarts it, the Virtualization framework automatically creates a new identity for the VM using the information from the Secure Enclave of the new Mac host. This identity change requires the person using the VM to reauthenticate to allow iCloud to restart syncing data to the VM.

What Apple’s AI Tells Us: Experimental Models⁴ While Apple is building narrow AI systems that can accurately answer questions about your personal data (“tell me when my mother is landing”), OpenAI wants to build autonomous agents that would complete complex tasks for you (“You know those emails about the new business I want to start, could you figure out what I should do to register it so that it is best for my taxes and do that.”). The first is, as Apple demonstrated, science fact, while the second is science fiction, at least for now.

(this digest in progress)

10 Jun 2024

MDG Blog Posts Permalink

Generative AI Video: Embracing Interoperability and Open-Source Tools (Clapper AI)

MDG June 10, 2024

Pointing out an important AI video opportunity: Open source script to screen AI video editor Clapper AI. This project and format is singular in the market. Studios, media companies, producers and talent all stand to benefit by supporting development while it is in the nascent stage.

It would set a baseline standard thats interoperable and open. It sits atop existing workflows and pipelines This is a framework that scales for everyone, if done correctly with the added benefit of avoiding costly strategic dependencies in a future timeline.

The ability to switch between different vendors and models seamlessly is a game-changer. This flexibility drives competition like Sora against Veo, StabilityAI against ElevenLabs, and Replicate against Fal.ai, allowing users to harness the best features from each. Unlike traditional open-source tools such as Blender and GIMP, which typically compete against a single commercial product, this tool’s interoperability is a significant competitive edge.

As the media industry is forced to reckon with generative AI multi-model tools, the importance of early stage strategic long term thinking is critical. Be smart, local integration of compute and training into natural production workflows, will fundamentally change how content is created and consumed.

For studios, an open-source script-to-screen AI video editor offers efficiencies + customization and strategic independence. Producers would benefit interoperability + streamline workflows, allowing the focus to be on storytelling

The barrier removed for talent so they can prototype their own narratives, better understand set and setting and scene context, dev new story telling skills and act as a catalyst for creative acceleration. Incremental not supplemental.

Interoperable standards can ensure compatibility between different tools and models, enabling seamless workflows and maximizing efficiency, as it should be. (mp3 and audio codecs anyone) It will also foster innovation and adaptation, but perhaps more importantly avoiding dependencies on specific vendors allows for strategic flexibility. Consider this an open call to action for dev.

Links: – still in dev…

Code https://github.com/jbilcke-hf/clapper
App https://clapper.app

Ed note: We are a long, long, long, way from this replacing film making. This is how we don’t leave that behind. The list of AI generated multi-model media thats meaningful can be counted on zero fingers right now, be a part of that change. Fin

07 Jun 2024

MDG Blog Posts Permalink

Mediaeater Digest Vol.30, No. 159

MDG June 7, 2024

YouTube Copyright Transparency Report (youtube) paid out over $70 billion to creators, artists, and media companies in recent years, and it boasts over 100 million Premium and Music subscribers. Its Content ID system, which allows rightsholders to monetize user-generated content, has paid out over $9 billion in ad revenue

Adobe roofies all of their customers (youtube) After they went to the cloud I stopped using their products. -ed

How to Lead an Army of Digital Sleuths in the Age of AI (nyt) Eliot Higgins and his 28,000 forensic foot soldiers at Bellingcat have kept a miraculous nose for truth—and a sharp sense of its limits—in Gaza, Ukraine, and everywhere else atrocities hide online.

Apple to Debut Passwords App in Challenge to 1Password, LastPass (bloomberg) Apple Inc. will introduce a new homegrown app next week called Passwords, aiming to make it easier for customers to log in to websites and software, according to people with knowledge of the matter.

Andrew Ng (Twitter) The effort to protect innovation and open source continues. I believe we’re all better off if anyone can carry out basic AI research and share their innovations. Right now, I’m deeply concerned about California’s proposed law SB-1047. It’s a long, complex bill with many parts that require safety assessments, shutdown capability for models, and so on.

FTC Opens Antitrust Probe of Microsoft AI Deal (WSJ) Commission has sent subpoenas to tech giant and startup, asking whether their partnership evaded required government review

Heymusic.ai – Coherence-Oriented Contrastive Learning of Musical Audio Representations

Secret hand gestures in paintings (pdf) – hand signs in visual art may provide clues about the underlying iconographical symbols. This paper will examine the eventual hidden meanings behind a peculiar hand gesture that has been widely used by several painters

KLING – Chinese new DiT Video AI Generation model 【KLING】 Open access Generate 120s Video with FPS30 1080P, Understand Physics Better, Model Complex Motion Accurately prompt: Traveling by train, viewing all sorts of landscapes through the window.

Marker: Converts PDFs to Markdown using deep learning, supports all languages.

Intro to LLM’s – Slides from talk by Andrej Karpathy

06 Jun 2024

MDG Blog Posts Permalink

Capital

MDG June 6, 2024

The special skill of each individual machine-operator, who has now been deprived of all significance, vanishes as an infinitesimal quantity in the face of the science, the gigantic natural forces, and the mass of the social labour embodied in the system of machinery, which, together with these three forces, constitutes the power of the master’.

Karl Marx, Capital, 1867

31 May 2024

MDG Blog Posts Permalink

Mediaeater Digest Vol.30, No. 151

MDG May 31, 2024

Announcing Showrunner (The Simulation) We believe the future is a mix of game & movie. Simulations powering 1000s of Truman Shows populated by interactive AI characters. Welcome to Sim Francisco & Showrunner! With Showrunner, you’ll be able to WATCH and DIRECT stories from Simulations like Sim Francisco or Neo-Tokyo or Fallen London. Run out of episodes of your favorite AI show? Generate another! Or let a simulation generate it for you based on its understanding of the characters.

I could drag this out infinitely, (Mario Klingemann ) but I think 9:24 is torture enough. New song “On and On”, celebrating @suno_ai_ ‘s 3.5 model’s unrelenting will to keep the music going whatever it takes.

OpenAI finds Russian and Chinese groups used its tech for propaganda campaigns “We’ve seen them generating text at a higher volume and with fewer errors than these operations have traditionally managed,” Nimmo, who previously worked at Meta tracking influence operations, said in a briefing with reporters. Nimmo said it’s possible that other groups may still be using OpenAI’s tools without the company’s knowledge.

AI Integration and Modularization (Stratechery) Google – The first takeaway from this analysis is that Google’s strategy truly is unique: they are, as Nadella noted, the Apple of AI. The bigger question is if this matters: as I noted above, integration has proven to be a sustainable differentiation in (1) the consumer market, where the buyer is the user, and thus values the user experience benefits th

Utopian Realism, a speech by Bruce Sterling (From the Technology Biennial in Turin, Italy, April 02024.) Five completely different self-invented unique toasters. Why? Why not just go buy the toaster at a store? Well — he very much wants to hand-make a toaster. He wants his toaster as a radically different toaster, the one that belongs to him. This is a “utopian device” in the sense of something that seems visionary, farfetched and silly. It’s just not practical, not realistic — but it’s practical and realistic for him. Calder tended to make art out of objects that the world had abandoned. Like the Turinese “Arte Povera” method — find junk, and dress it up, and re-format it. He had a different value system. To him this is is not junk. To him, this is a struggle for understanding

The Tech Baron Seeking to Purge San Francisco of “Blues” Still, his appetite for autocracy is bottomless. Last October, Balaji hosted the first-ever Network State Conference. Garry Tan—the current Y Combinator CEO who’s attempting to spearhead a political takeover of San Francisco—participated in an interview with Balaji and cast the effort as part of the Network State movement. Tan, who made headlines in January after tweeting “die slow motherfuckers” at local progressive politicians, frames his campaign as an experiment in “moderate” politics. But in a podcast interview one month before the conference, Balaji laid out a more disturbing and extreme vision. (you can take the kid out of Long Island but not the LI out of the kid -ed )

-fast links

Curated directory of cybersecurity tools and resources
RAG CLI
1-bit LLMs Could Solve AI’s Energy Demands “Imprecise” language models are smaller, speedier—and nearly as accurate

28 May 2024

MDG Blog Posts Permalink

Mediaeater Digest Vol. 30, No. 149

MDG May 28, 2024

Taxonomy of Human Rights Risks – Connected to Generative AI this document explores human rights risks stemming from the development, deployment, and use of generative AI technology. Establishing such a rights-based taxonomy is crucial for understanding how the United Nations Guiding Principles on Business and Human Rights (UNGPs) should be operationalised in addressing human rights risks connected to generative AI. This taxonomy is concerned with demonstrating how the most significant harms to people related to generative AI are in fact impacts on internationally agreed human rights.

Surveilling the Masses with Wi-Fi-Based Positioning Systems, we show that Apple’s WPS can be abused to create a privacy threat on a global scale. We present an attack that allows an unprivileged attacker to amass a worldwide snapshot of Wi-Fi BSSID geolocations in only a matter of days. Our attack makes few assumptions, merely exploiting the fact that there are relatively few dense regions of allocated MAC address space. Applying this technique over the course of a year, we learned the precise locations of over 2 billion BSSIDs around the world. (Location has become a victim of tech moving unchecked – wtf happend to Carpenter -ed)

Ransomware Group Claims Responsibility for Christie’s Hack At Christie’s, a spokesman said in a statement, “Our investigations determined there was unauthorized access by a third party to parts of Christie’s network.” The spokesman, Edward Lewine, said that the investigations “also determined that the group behind the incident took some limited amount of personal data relating to some of our clients.” He added, “There is no evidence that any financial or transactional records were compromised.” (White glove opsec is not a thing here -ed)

OpenAI has announced that it is training its next generation of AI models, the follow-up to its state-of-the-art GPT-4 software which powers ChatGPT. (what will be notable is what data is in and what data is opted out and what that looks like in product. -ed)

LeCun to Elon Musk on X about his latest AI goals- Join xAI if you can stand a boss who:
– claims that what you are working on will be solved next year (no pressure).
– claims that what you are working on will kill everyone and must be stopped or paused (yay, vacation for 6 months!).
– claims to want a “maximally rigorous pursuit of the truth” but spews crazy-ass conspiracy theories on his own social platform.

Ken Burns, H’24 Keynote Address to Brandeis University’s 2024 Graduates “The checks of conscience are thrown aside and a deformed picture of the soul is revealed. The presumptive Republican nominee is the opioid of all opioids, an easy cure for what some believe is the solution to our myriad pains and problems. When in fact with him, you end up re-enslaved with an even bigger problem, a worse affliction and addiction, ‘a bigger delusion’, James Baldwin would say, the author and finisher of our national existence, our national suicide as Mr. Lincoln prophesies. Do not be seduced by easy equalization. There is nothing equal about this equation. We are at an existential crossroads in our political and civic lives. This is a choice that could not be clearer. (this is a text antidote to Harrison Butker’s BS -ed)

The Promise of Multi-Agent AI Think of multi-agent systems as teams of experts, each contributing unique knowledge and abilities to collectively tackle difficult problems. Tasks are broken down into components and assigned to the agent best equipped to handle them. As each agent processes its part of the task and passes information to the next, the output is progressively refined and improved. Through such specialization, the resulting systems can achieve results that generalist agents struggle to match.

Steven Chu: ‘Wall Street analysts are totally amoral’ on climate FT In a lecture at the UK’s Royal Academy of Engineering, where he is an international fellow, Chu laid out the challenge. We sometimes refer to previous energy transitions — from wood to coal, or from coal to gas. Chu points out that they never really happened. Humans mostly added new sources of energy. Now, in less than three decades, we hope to transition away from oil, gas and coal. Chu doesn’t believe it’s possible, but he argues we must go as fast as we can.

The good news is the f

23 May 2024

MDG Blog Posts Permalink

Mediaeater Digest Vol 30, No. 144

MDG May 23, 2024

Meta AI chief says large language models will not reach human intelligence. (FT) At an event in London on Tuesday, DeepMind’s chief Sir Demis Hassabis said what was missing from language models was “they didn’t understand the spatial context you’re in . . . so that limits their usefulness in the end”.

Cars & Consumer Data: On Unlawful Collection & Use (FTC) Car manufacturers—and all businesses—should take note that the FTC will take action to protect consumers against the illegal collection, use, and disclosure of their personal data. Recent enforcement actions illustrate this point:

Visibility into AI Agents (arxiv) Visibility informs actions to manage risks from the deployment of increasingly agentic systems, but is not by itself sufficient. Even with a comprehensive understanding of agent activities, those harmed by them may not have the power to intervene and reduce risks. To best make use of visibility, future work could investigate increasing public influence over AI development and deployment, developing a wide range of potential policy levers, and implementing infrastructure and practices to prevent or defend against harms

News Corp and OpenAI today announced a multi-year agreement. So far, Axel Springer has secured a $30 million, three-year deal, while the Financial Times landed a $10 million per year agreement. Rupert Murdoch has netted a substantial rate of $50 million per year, but this covers a bundle of at least seven publications. OpenAI is currently offering between $7 million to $10 million per year per publication as the prevailing rate in the industry. For more context, it would take an 100’s of thousands of deals to satisfy what was stolen. Lastly, getting in early is crucial; the last one in line will face the least favorable deal terms. Lastly if you were an IP rights holder suing them…. Also no word on how Rupert might integrate this with his spy craft.

DOJ Ticketmaster (DOJ). Live and direct at 11 AM EDT Also DOJ more music news, Purported Rap Promoter Pleads Guilty to Role in $2.3M Nationwide Fraud Conspiracy. (ouch “purported”).

DOJ and Ticketmaster: A Timeline
– 1976: Ticketmaster is founded.
– 1991: Ticketmaster merges with Front Line Management; DOJ investigates and allows the merger with conditions.
– 2010: Ticketmaster merges with Live Nation to form Live Nation Entertainment; DOJ approves with conditions to foster competition.
– 2019: DOJ launches an investigation into Live Nation Entertainment for potential violations of the 2010 merger conditions.
– 2023: January: Senate Judiciary Committee holds a hearing on competition in the ticketing industry, fueled by controversy surrounding Ticketmaster’s handling of Taylor Swift’s tour tickets.
– May 23 -2024 DOJ files a new lawsuit against Live Nation Entertainment, alleging antitrust violations and anti-competitive practices “It is time to break it up”.

21 May 2024

MDG Blog Posts Permalink

On device

MDG May 21, 2024

When Google announces it can monitor phone conversations for hostile scams, and Microsoft claims it can search everything you’ve ever touched on your computer, the importance of on-device AI becomes clear. Apple is likely to unveil their own on-device AI capabilities at their upcoming developer event, solidifying this trend.

The question is, how much of this AI processing will truly stay on-device? Will learnings, user preferences always remain local? It’s crucial to understand the implications of on-device AI for personal OPESEC.

On-device AI can be more nuanced. Some examples,,,

Model Updates: While the AI model itself runs on your device, updates to that model might be downloaded from external servers. These updates could include new features, improved accuracy, or bug fixes.
Anonymized Data: Some on-device AI systems may collect anonymized data about usage patterns or performance to help developers improve the model. This data is stripped of personally identifiable information before being sent.
User Consent: Certain applications might request your permission to share specific data for specific purposes, even with an on-device AI model. For example, a voice assistant might ask to send your voice commands to a server for better speech recognition.
Hybrid Models: Some AI systems use a combination of on-device and cloud-based processing. In these cases, certain parts of the AI model might run on your device, while others rely on external servers for more computationally intensive tasks.

Whats possible now, (or what should always be local, and if not, ask why?)

Optimist in me says – Most important trend done correctly, solves for latency, keeps works offline, keeps data on-device

Pessimist in me says: Privacy is always the first victim of convenience. Creep happens, and this feature creep will lead to more and more user consent in the guise of utility and convenience likely geeting neither in the short term.

I came across this post from Clem at Hugging Face, it summed it up nicely while pointing to four models to use . “- No cloud, no cost, no data sent to anyone, no problem. Welcome to local AI on Hugging Face! ” Open source FTW