September 07, 2010

Linux Weekly News

Cairo 1.10.0 available

The 1.10.0 release of the Cairo graphics library has finally been released. "One of the more interesting departures for cairo for this release is the inclusion of a tracing utility, cairo-trace. cairo-trace generates a human-readable, replayable, compact representation of the sequences of drawing commands made by an application. This can be used to inspecting applications to understand issues and as a means for profiling real-world usage of cairo." The profiling feature has evidently been used to improve performance in a number of areas. There is also improved printing support, better 16-bit buffer support, and better use of hardware acceleration.

by corbet at September 07, 2010 01:12 PM

Linux Today

Five critical apps for Android that you want to find on iOS

BerkeleyLUG: "It seems like nearly every 6 months, Apple redefines just how closed their closed platform can be. Instead of just completely restricting the source and distribution of their own software, they now also limit the kind of third-party applications and content their users can enjoy."

September 07, 2010 12:35 PM

September 2010 issue of The NEW PCLinuxOS Magazine Released

PCLinuxOS Magazine: OpenOffice Impress, LXDE, Quick GIMP Tip, Meemaw, Linux Market Share...

September 07, 2010 12:05 PM

KDE SC 4.5 – Desktop Activities Exposed

everyday linux how2s: "Recently, I wrote a post about the KDE 4.5 SC release candidate. In that post, I was a little bit critical about the discoverability of desktop activities."

September 07, 2010 11:05 AM

Infoworld: Open Sources

What Paul Allen and Larry Ellison have in common

At first sight, this extraordinary legal action against most of the digital world's leading lights might seem one of a kind:

read more

by Open Sources at September 07, 2010 10:00 AM

O'Reilly radar

Four short links: 7 September 2010

  1. GalaxyZoo for Climate Science? -- GalaxyZoo is the crowdsourced physics research. A group of climate scientists want the same, to help predict "weather events". See also the Guardian article. (via adw_tweets on Twitter)
  2. Crispian's Science Map -- gorgeous Underground-style map showing scientists and their contributions. (via arjenlentz on Twitter)
  3. Programming Things I Wish I Knew Earlier (Ted Dziuba) -- opinionated piece, but boils down to "keep it simple until you can't", and "the more you know about the actual hardware, the better you can code". With EC2, when Amazon says "I/O performance: High", what does that even mean? Is that suitable for a heavy random read scenario? (via Hacker News)
  4. The Molecular Biology Carnival, 2ed -- collection of excellent blog writing about molecular biology. (via BioinfoTools on Twitter)

by Nat Torkington at September 07, 2010 10:00 AM

Linux Today

Nightingale - Much Anticipated Songbird Fork Which is Soon Going to Have its Fir

Tech Drive-in: "Songbird is still open source, but developers behind Songbird have decided to discontinue support for Linux and the concentrate on Mac and Windows versions of Songbird."

September 07, 2010 07:05 AM

Review: Kubuntu 10.04 Trinity "Lucid Lynx" (Idea by Candid of Linux Today)

Das U-Blog by Prashanth: "This is how I felt when testing Kubuntu 10.04 Trinity. It's a weird mishmash of old-school and new-school KDE."

September 07, 2010 03:05 AM

September 06, 2010

Linux Today

Reviewed: Linux Mint 9 KDE

Tuxradar: "Ubuntu is dominating the mindshare for desktop distros, but there's still a place for rivals – even those built on the Ubuntu foundation – to take things in different directions."

September 06, 2010 11:04 PM

OStatic blogs

Campsite a Hearty Content Management System for Journalists

Campsite

While the newspaper industry is scrambling to find ways to engage -- and keep -- readers, a team of developers has been quietly creating a free, open source content management system (CMS) to help online news Web sites reach their audience. Campsite is a CMS specifically designed for news organizations big and small, and contains features that are near and dear to a journalist's heart.

Although Campsite is designed for multiple users, articles undergoing editing are locked so writers and editors can't inadvertently trample on each other's work. The WYSIWYG editor supports text formatting, image insertion, mutli-page posts, and more. Articles can be grouped into sections, then sections can be grouped into single issues or editions, much like the layout a traditional newspaper. Campsite also permits scheduled (post-dated) publishing and multiple article formatting templates.

Readers are the lifeblood of any publication and Campsite offers some terrific subscriber management tools. Support for trial subscriptions and and sliding subscriber pay periods is available, as well as the ability to lock down certain sections of the site for pay-only viewing.

On the developer side, Campsite is built on the LAMP development stack and includes an object-oriented API so users can create their own plugins or alternative interfaces. There's a robust developer community surrounding the app, but there are also a team of full-time developers working on the project who will quickly create additional features for a small fee.

To get an idea of what Campsite looks like in action, have a look at the Media Development Loan Fund Web site, or one of these European-based news outlets. The app's full feature list, extensive documentation, downloads, and demos are all available online.

Related Activities

Related Software

Related Blog Posts


Alcatel-Lucent NextGen Communications Spotlight — Learn More »

by Lisa Hoover at September 06, 2010 08:23 PM

Linux Weekly News

Graesslin: Driver dilemma in KDE workspaces 4.5

Martin Graesslin looks at problems with the interaction between KWin and some graphics drivers. "Now that I have explained all our checks we did to ensure a smooth user experience, I want to explain how it could happen that there are regressions in 4.5. In 4.5 we introduced two new features which require OpenGL Shaders: the blur effect and the lanczos filter. Both are not hard requirements. Blur effect can easily be turned off by disabling the effect and the lanczos filter is controlled by the general effect level settings which is also used for Plasma and Oxygen animations. Both new features check for the required extensions and get only activated iff the driver claims support for it. So everything should be fine, shouldn't it? Apparently not when it comes to the free graphics drivers (please note and remember: we do not see such problems with the proprietary NVIDIA driver!)." (Thanks to Jos Poortvliet)

by ris at September 06, 2010 08:08 PM

Linux Today

Some useful utilities

Ubuntu Musings: "Utilities. They're what help make computing a lot easier and more convenient, no matter what operating system you use."

September 06, 2010 07:04 PM

GigaOM by Om Malik

28266486_2e39669e4d_z

Author Nicholas Carr wrote a controversial post recently about the use of hyperlinks in online content, in which he argued that links were a distraction for readers, and were likely to lead to less comprehension rather than more. This idea was an offshoot of Carr’s latest book, The Shallows, which makes the claim that the Internet — and digital media in general — are making society dumber rather than smarter. Now Scott Rosenberg, one of the founders of the online magazine Salon and of a new media-accuracy startup called MediaBugs, has written an admirable series of posts defending the link as the heart and soul of the web. In his original post, Carr described links as “conveniences,” but said they also functioned as a distraction for readers:

Sometimes, they’re big distractions – we click on a link, then another, then another, and pretty soon we’ve forgotten what we’d started out to do or to read,” he wrote. “Other times, they’re tiny distractions, little textual gnats buzzing around your head.

The author said that research he looked at for his book showed this created a “cognitive load” for readers, and those who read hypertext “comprehend and learn less… than those who read the same material in printed form.” Some prominent writers and media figures agreed with Carr’s take, including — ironically — Laura Miller, a writer and book reviewer with Salon, who argued that links shouldn’t be necessary if writers did their job of synthesizing the topic properly, and said that most people don’t click on links anyway. Carr also got some support from Jason Fry, writing at the Neiman Journalism Lab, and Ryan Chittum in a piece for the Columbia Journalism Review (in the spirit of full disclosure, I wrote about Carr’s argument on my personal blog).

As Rosenberg describes in his first post in response to Carr, much of the research that the author relies on for his attack on hyperlinks and comprehension don’t really fit with his broad thesis. For example, the kinds of links that were studied in the research Carr uses in “The Shallows” had nothing to do with adding context to the text that they were embedded in; in other words, they weren’t the kind of hyperlinks that everyone is used to in blog posts and other Internet content. As Rosenberg notes:

All this study proved was something we already knew: that badly executed hypertext can indeed ruin the process of reading. So, of course, can badly executed narrative structure, or grammar, or punctuation.

Instead of impeding understanding, as Carr and his supporters argue, Rosenberg says he believes that they deepen it, quoting author Steven Johnson as saying that links are a tool for synthesis, “a way of drawing connections between things,” to bring coherence to the vast universe of information online. “The Web’s links don’t make it a vast wasteland or a murky shallows,” Rosenberg says, “they organize and enrich it.” I’m firmly on the Salon founder’s side in this one — without links, what point is there in having hypertext at all? The whole idea behind Tim Berners-Lee’s invention was to enable sites to point to each other and create a “web” of context. Do they impose a cognitive load of some kind on users? Possibly, but in my view, the benefits far outweigh the disadvantages.

In his second post on links, Rosenberg first takes on what he calls “corporate linking,” which is the practice of clogging up text with links “because they provide some tangible business value to the linker: they cookie a user for an affiliate program, or boost a target page’s Google rank, or aim to increase a site’s “stickiness” by getting the reader to click through to another page.” Rosenberg also argues that much of this is Google’s responsibility, because of the value attached to page rank and links:

Google is a great tool because it draws meaning from links. And it is a profitable company because it has placed a tiny but real financial value on many links. But by making links a business, Google also made it harder for editors and writers to defend responsible linking.

In the third post in his series, Rosenberg says that even if Carr is right and links do slow down reading and get in the way of understanding the content they appear in, he would still prefer to have links, because they are “additive and creative.” Links pull together different pieces of a topic and connect them into a whole, he says, and at their best, they also “show a writer’s work” and are “badges of honesty, inviting readers to check that work.” Rosenberg adds that the use of links has multiple benefits, including:

  • Saying hello. “A link to another site can serve as a way of telling that site, ‘I just said something about you.’ This kind of link remains a valid and valuable social gesture.”
  • Showing your work. “Some people are happier with this stuff collected at the end, as we did for centuries in print. But linking in situ gives the reader the information right where it’s needed.”
  • Fairness. “Does a writer present the perspectives of those he disagrees with in a way that they feel is fair? Linking to those perspectives is a way for a writer to say: Go ahead — see if I got you right.”
  • Adding context. “A fragment that gets connected is no longer a fragment. It becomes a working part, a piece of a mosaic, a strand in a web.”

As Rosenberg puts it in the conclusion to his series, writing online without linking “is like making a movie without cutting. Sure, it can be done; there might even be a few situations where it makes sense. But most of the time, it’s just head-scratchingly self-limiting. To choose not to link is to abandon the medium’s most powerful tool — the thing that makes the Web a web.” Hear, hear.

Related content from GigaOM Pro (sub req’d): Why Google Should Fear the Social Web

Post and thumbnail photos courtesy of Flickr users papalars and Robert Brook


Alcatel-Lucent NextGen Communications Spotlight — Learn More »

by Mathew Ingram at September 06, 2010 07:00 PM

Linux Weekly News

Monday's security updates

Debian has updated smbind (sql injection).

Fedora has updated pam_mount (F13, F12: arbitrary code execution), libhx (F13, F12: arbitrary code execution), F13: python (multiple vulnerabilities), and F12: sblim-sfcb (arbitrary code execution).

Mandriva has updated lvm2 (privilege escalation).

Pardus has updated phpmyadmin (cross-site scripting) and mysql (multiple vulnerabilities).

by ris at September 06, 2010 05:48 PM

OStatic blogs

SparkleShare Shaping Up to be Slick FOSS Alternative to Dropbox

SparkleShare

If you love Dropbox for easy file sharing across computers but are longing for something free and open source, you're wish is closer to be granted. The team of developers behind the GNU GPLv3-licensed SparkleShare released a beta version of its new app and it's shaping up to be pretty slick.

Though still in its infancy, baby SparkeShare already has some key features you'd want in a file-sharing app. Built-in version control keeps a record of changes made to each file and makes revisions a snap in case of mistakes. Since users host the app on their own servers, the number of files you manage or collaborators you assign to projects are limited only by the size of your disk drive.

App maintainer Hylke Bons says, "Aside from the usual bug fixes and behind the scenes work I mainly added features that increase productivity in the event logs. Not only does it look a lot prettier, each entry in an event log now has a clickable link for easy access to files. It refreshes automatically on new events as well. The Nautilus plugin now has the 'Copy Web Link' context menu item, which makes sharing links a whole lot easier."

Check out the documentation to learn more about SparkleShare, then pick up the free download and check it out for yourself. To keep up on project development, follow the team on Twitter and keep an eye on the wiki.

Related Activities

Related Blog Posts


Alcatel-Lucent NextGen Communications Spotlight — Learn More »

by Lisa Hoover at September 06, 2010 05:12 PM

O'Reilly radar

"Spontaneous collaboration" and other lessons from the private sector

Gov 2.0 Summit, 2010Cisco CTO Padmasree Warrior's vision for government's future includes smarter cities, real-time communication over national boundaries and more efficient collaboration with citizens.

Warrior touched on these ideas during our recent interview, and she'll expand on many of them during her conversation with Tim O'Reilly at this week's Gov 2.0 Summit. Video highlights from our wide-ranging interview are embedded below.

(Note: This interview was conducted via Cisco's TelePresence. Exporting video from that format presents technical challenges, so portions were recorded with a Flip camera and an iPhone.)

Video and spontaneous collaboration

Given that our interview was conducted via video conferencing technology, I asked Warrior which private-sector tech lessons can be applied to the public world.

"A big lesson that can be transferred from the private sector is kind of already happening. It's how can we use technology, like this [referring to TelePresence], to spontaneously bring ideas together," she said. That goes with the notion of open government, suggested Warrior. "How do you enable citizens to participate in brainstorming sessions, idea collection, in a more spontaneous way? The power of video is that it really allows us to extend the abstract notions of text-based technology and replaces that with much more human way of communicating. It's more natural."



The power of platforms


Platforms such as Amazon's cloud, Apple's App Store, Twitter and Facebook are key parts of the Web 2.0 world. I asked Warrior what government can learn or adopt from these examples.

"The broader access you have to ideas, the stronger the end result will be," said Warrior. "Whatever the platform, the idea is how do get more innovation onto the platform." She sees a clear opportunity for government, but challenges lie in separating signals from noise and applying useful filters so decision makers can enact informed policies.

The evolution of smarter cities

Last month, Warrior shared a link on Twitter about how sensor networks in buildings could use air conditioning ducts as building-wide antennas. Dovetailing with that, I asked her about the evolution of smarter cities.

"If you step back a little bit and think about what's happening, this is going to be a problem that we're all going to face in the next 10-20 years," she said. "There's rapid urbanization going on around the world. We're expecting maybe about 100 new cities, with over 100 million people. New cities, that would be created over the next 10-15 years or so. So the challenge that we all face is how do we enable this urbanization to happen in a different way than we have done in the past. What role can technology play in building smarter cities, cities that are more sustainable, that are greener, that are more efficient?"

That perspective was further expressed by a recent tweet from Warrior, where she shared a piece from Science Daily: "Networks -- not size -- give cities competitive advantage.

On cloud computing, innovation and enterprise collaboration

The last part of our conversation focused on operating in a time of resource scarcity and the use of social software within Cisco itself.

"You don't want to compromise innovation through piping and cost cutting," said Warrior. "I think there will be technology-enabled ways to innovate that the government has to think about as well."

If open government is done properly, according to Warrior, it will increase participation and share the load of the work. "It will drive that speed and the better quality of decisions. If not, it will end up being more bureaucratic, because the noise level is higher than the signal level. I think the key thing in open gov or any kind of open platform is optimizing the signal-to-noise ratio."

Privacy and the transfer of information across communities

One part of the interview that did not make it onto YouTube focused upon the challenges for both government and enterprises that adopt cloud computing. Warrior pointed to the importance addressing the dual issues of authentication and identity, which from her point of view are essential issues. Those are precisely the topics, in fact, that will be focused on at the Internet Identity Workshop in Washington, D.C. this week.

Warrior was thoughtful about the privacy issues that result from digital citizenship and business in the cloud. "There's a difference between identity and community," she said. "I have one identity that's visible to many, being CTO of Cisco. That identity needs to be authentic. I tweet personal things because people want to know who the person is behind the title. At the same time, I belong to a community of Cornell alumnae, to women in tech, to haiku writers, and to southeastern Asian-Americans. You have to know what community is appropriate to share information with and how."

The issue, explained Warrior, is the appropriate transferability of information from one community to another. That's at the heart of privacy concerns about Facebook or Google's initial missteps with Buzz. As government considers cloud computing models, getting privacy right there will be even more important.

by Alex Howard at September 06, 2010 05:00 PM

GigaOM by Om Malik

Documents To Go

Research in Motion, the Waterloo, Canada-based company behind the iconic Blackberry smartphones is rumored to have snapped up DataViz, a Milford, CT.-based company that is well known for making mobile productivity suite, Documents to Go. The CT-based company makes this extremely popular app for all major smartphone platforms including iPhone, Android, Blackberry and Nokia’s Maemo. It also supports the iPad. DataViz recently suspended work on the WebOS version of the suite.

The rumors of the acquisition first appeared on the Crackberry Blog which apparently has received confirmation from multiple sources for the deal. RIM is said to have paid $50 million. In addition, they point out on LinkedIn many of the DataViz employees are now listing RIM as their employer. Both RIM and DataViz have not made any official announcements. I am waiting to hear back from RIM and DataViz, though I am not expecting any one to respond, considering today is a holiday on account of the Labor Day.

If the rumors are indeed true, then RIM has made a great buy. Unlike the Cellmania purchase, this one actually makes a lot of sense and I would put it right next to last year’s acquisition of WebKit browser maker, Torch Mobile.

Given how much of Blackberry is used inside the enterprise, it makes perfect sense for RIM to add apps for the enterprise to its arsenal. The company despite a higher install base than some of its newer rivals has struggled to capture the imagination of developers. Many of the best Blackberry apps are infact being made by RIM itself. DataViz’s products would also be ideally suited for the company’s rumored iPad competitor.

Related research on GigaOM Pro (sub. req’d): To Win In the Mobile Market, Focus On Consumer


Alcatel-Lucent NextGen Communications Spotlight — Learn More »

by Om Malik at September 06, 2010 04:51 PM

OStatic blogs

KeePass 2.13 Released, Sports Big Batch of New Features

KeePass

If you're not familiar with KeePass, a terrific free, open source password manager for Windows, you really ought to get to know it. In fact, it's on our own Sam Dean's list of 10 free OSS security applications that you can trust. Already a robust application for locking down all your passwords, it just got better now that the development team released KeePass 2.13 with a batch of new features and improvements.

The app's database stores all your password and registration in one place, and secures them with a single master password that accessible to no one but the user. In fact, KeePass' encryption is so strong that even if you used every computer in the world to simultaneously attack it's database, "decrypting it would take longer than the age of the universe."

The password database is stored in a single file that's easy to carry on a thumbdrive or burn to a CD, or easily export data to one of several formats including TXT, HTML, XML, and CSV. KeePass can also read data imported from other popular password managers like Password Safe v2 and Password Agent.

Some of the new features in this version include:

* Global auto-type (using a system-wide hot key) is now possible on Unix-like systems

* Added IPC functionality for Unix-like systems

* If IO credentials are stored, they are now obfuscated

* Tag lists are sorted alphabetically now

* Password quality estimation algorithm: added check for about 1500 most common passwords

If you're using a different version of KeePass, developers recommend you upgrade to version 2.13 now. The app is developed for Windows, but there are several unofficial contributed builds for Linux, Mac OS X, BlackBerry, Android, and more.

Related Activities

Related Software

Related Blog Posts


Alcatel-Lucent NextGen Communications Spotlight — Learn More »

by Lisa Hoover at September 06, 2010 04:32 PM

Linux Weekly News

Systemd Test Day on Tuesday 2010/09/07

Fedora will be holding a Systemd test day on September 7, 2010. "This week's Test Day, which will take place on Tuesday 2010/09/07 rather than the more usual Thursday, is on systemd, so it's a very important one! It will also serve at least two functions: as usual, the testing will help us to improve the code so that if it does go into the final Fedora 14 release it will work as well as possible, but the Fedora steering committee will also be using the results of the Test Day to help inform their final decision as to whether to go ahead with systemd for the Beta and final release, or whether to revert to upstart. So there's a lot riding on this Test Day."

by ris at September 06, 2010 04:29 PM

GDB 7.2 released

Version 7.2 of the GDB debugger is out. New features include support for the D language, some C++ improvements, better Python support, better tracepoint support, and more; see the announcement for the details.

by corbet at September 06, 2010 04:10 PM

GigaOM by Om Malik

pinggom

Last week, at its usual September iPod product refresh, Apple rolled out Ping, and critics simultaneously questioned whether or not Apple could build a social network to challenge the likes of Facebook and Twitter. As I discuss in my weekly column at GigaOM Pro, the real question isn’t if Apple can, but rather, if the folks in Cupertino even want to pursue such a move.

As it stands now, Ping is explicitly about selling music on the iTunes store. Om thinks it foreshadows the future of social commerce, but where else could Apple take Ping, and how far?

Some analysts describe social networking as air, but perhaps the more relevant metaphor is electricity. In this view, companies and sites tap into social networking to create applications or experiences. Right now, Apple is treating social media as electricity to fuel its own shopping and communications applications.

Apple makes its money by selling products and “renting” its distribution channel. It likely won’t hire an advertising sales force, and Apple’s Me.com is a weak collection of fee-based services. I suspect Apple’s more comfortable creating social networking features that enhance its products and marketplaces, rather than building out a free-standing social network.

Standalone social networks like Facebook, Twitter and LinkedIn, then, probably won’t face Apple as a head-to-head competitor for their audiences, advertisers or what they deliver as their core user experience. Apple doesn’t appear to be interested in building a general-purpose social network, a short message broadcasting service, or a professional connections network. MySpace is way ahead of Apple in gathering artists’ pages and a social music audience, but Apple’s ability to drive sales makes it a fierce competitor for label attention.

Those companies, and others like Google, Yahoo and Microsoft, who aspire to provide social media APIs, services and even infrastructure, should cultivate, rather than compete with Apple, especially if they want to reach Apple’s customers. That means they should license or, if Apple’s in its usual DIY mode, integrate their own social networking technologies with Apple’s. By the time you read this, Ping users may be able to find their friends via Facebook Connect.

Read the full post here.


Alcatel-Lucent NextGen Communications Spotlight — Learn More »

by David Card at September 06, 2010 04:00 PM

Linux Weekly News

MWR Labs: Assessing the Tux Strength

The MWR Labs group at MWR Info Security is running a series of articles comparing Linux distributions from a security point of view. Part 1: user space memory protection looks at protection against memory corruption attacks, while Part 2 - into the kernel examines kernel security settings. "The notable exceptions in the results are Fedora and Ubuntu. Both distributions do not allow the ability to write code to a certain memory region and then execute it. This can be observed from the results of the first five tests. Fedora goes one step further and also prevents the bss, data and heap sections from being marked as executable using the 'mprotect' system call. It should be noted that there would still be numerous other memory regions where an attacker could upload their code and then use the 'mprotect' function to mark it as executable."

by corbet at September 06, 2010 03:51 PM

Linux Today

Chrome August's big winner as Internet Explorer resumes slide

ars Technica: "As browser competition continues to heat up, 2010 looks like the year when the market was repeatedly disrupted."

September 06, 2010 03:04 PM

Linux Weekly News

Stable kernel 2.4.37.10

The 2.4 kernel lives - for a little while longer, at least. Willy Tarreau has just released the 2.4.37.10 update, with a small set of important fixes. This might just be the last update in this series, unless some sort of important fix comes in. "If nothing happens before September 2011, it's possible that there won't be any 2.4.37.11 at all. By that time, the 2.6 kernel will have been available for almost 8 years, this should have been enough for anyone to have a look at it. Users now have one year to migrate or to report critical bugs. I think that's an honest deal." See the announcement for the full description of his planned policy.

by corbet at September 06, 2010 02:10 PM

GigaOM by Om Malik

Summer

On this Labor Day, as we in the tech world gear up for the busy fall season, I’m struck by the fact that there really didn’t seem to be a summer slowdown this year. From my perspective, the funding rounds, product releases, acquisitions, IPO filings and corporate scandals didn’t take a vacation. Perhaps there were fewer tech conferences in August, but it’s not like we had time to take off work early for the beach.

Maybe it’s just that it was frigidly cold in San Francisco for most of the last few months. Or maybe it’s that those dang super angels haven’t heard about how in August all venture capitalists are supposed to take their kids to the South of France — or at least Tahoe. Not that I’m complaining; I much prefer writing about real news than making something up!

In case you did take off to an island somewhere, or if you want to reminisce about how gosh-darn hard we all worked for the last three months, here’s a sprinkling of highlights from our top summer stories:

Photo courtesy Flickr user GViciano.


Alcatel-Lucent NextGen Communications Spotlight — Learn More »

by Liz Gannes at September 06, 2010 02:00 PM

O'Reilly radar

Bringing open government to courts

yu.harlan.jpgAs court records increasingly become digitized, unexpected consequences will result from that evolution. It's critical to be thinking through the authentication, cost and privacy issues before we get there.

Harlan Yu, a Princeton computer scientist, worked with a team to create online tools that enable free and open access to court records that highlight the need for more awareness. My interview with Yu this summer was a reminder that the state of open government is both further advanced and more muddled than the public realizes. As with so many issues, it's not necessarily about technology itself. Effective policy will be founded upon understanding the ways that people can and will interact with platforms. Although applying open government principles to public access for court documents is a little dry for the general public, the ramifications of digital records being published online means the issue deserves more sunlight. A condensed version of our interview follows.

Your open government work has focused improving public access to court records in the on PACER system. PACER stands for "Public Access to Court Electronic Records" but the reality of public access is more complicated. What's the history of your involvement with this aspect of open government?

Back in February of last year, Steve Schultze, who was at the time at the Berkman Center, was giving a round of talks about access to court materials on PACER. He came to CITP in February to give a talk with one of his colleagues. I had never heard of PACER before, but I went to Steve's talk and learned about how the federal government provides these documents that form the basis of our common law. I was appalled that these public records were essentially being sold to the public at the detriment to our democracy.

What did you propose to Schultze to fix this situation?

We thought there was a way that you could automatically allow PACER users to share documents that were legitimately purchased from the PACER system. Because these are public records, once a legitimate user pays for a document, they should be able to share it on their blog, send it to their friend, post it online, or do whatever they want with it. We decided to venture out and build a [Firefox] plug-in called RECAP that essentially automatically crowdsources the purchase of PACER documents.

Who else was involved in building RECAP?

Gov 2.0 Summit, 2010We worked with the Internet Archive and with Carl Malamud at public.resource.org. We built a system where users could download the RECAP plug-in and install it. While they used PACER, any time they purchased a docket or a PDF, whether it was a brief, an opinion or any motion, it automatically gets uploaded into our central repository in the background.

The quid pro quo in that, as you're using the RECAP plug-in, if we already have a document that has been uploaded by another user, that gets shown to you in PACER to say, "Hey, we already have a copy. Instead of purchasing another copy for $.08 or whatever it'll cost you, just get it from us for free."

We now have about 2.2 million PACER documents in our system, which is actually a small fraction of the total number of documents in the PACER system. The PACER administrative office claims that there are about 500 million documents in PACER, with 5 million being added every month. So 2.2 million is actually a pretty small number of documents, by percentage.

We think that we have a lot of the most commonly accessed documents. For the court cases that have high visibility, those are the ones that people access over and over. So we don't have a lot of "long tail," but we have a lot of the ones that are most commonly used.

Are there privacy and security considerations here? Why does the concept of "practical security" matter to open government?

We'd like to make all of these documents freely available to the public. We've found a couple of different barriers to offering free and open public access. The biggest one is definitely privacy. When an attorney files a brief [in federal courts], they need to ensure that sensitive information is redacted. Whether it's a Social Security number, the name of a minor, bank account numbers, all of these things need to be redacted before the public filing, so when they put it on PACER, it can't be mined for this private information. In the past, the courts themselves haven't been very vigilant in making sure their own rules were properly applied. That's mainly because of "practical obscurity." These documents were behind this paywall, or you had to go to the courts to actually get a copy. The documents weren't just freely available on Google. The worry about privacy was not as significant, because even if there were a Social Security number, it wouldn't be widely distributed. People didn't care so much about the privacy implications.

So a condition of "privacy by obscurity" persisted?

Exactly. The information's out there publicly in public record, but it's practically obscure from public view. So now we have a lot of these PDF documents, but there's actually a number of these documents that have private information, like Social Security numbers, the names of minors or names of informants. Just going out and publishing these documents on Google isn't necessarily the best and most moral thing to do.

I think one of the consequences of RECAP, Carl's work and our work in trying to get these documents online is the realization that eventually all of these documents will be made public. The courts need to be a lot more serious about applying their own rules in their own courts to protect the privacy of citizens. The main problem is that in the past, even though these records weren't available publicly and made freely available, there were already entities in the courtrooms essentially mining this information. For example, in bankruptcy cases, there were already data aggregators looking through court records everyday, finding Social Security numbers, and adding this information into people's dossier but out of the view of the public. Bringing this privacy issue to the forefront, even if these documents aren't yet publicly available, will make a big impact on protecting privacy of citizens who are involved in court cases.

As court records become more public, what will that mean for citizens?

If somebody sues you -- and it's a claim that eventually is unfounded -- that might end up in some dossier and the information may be incorrect. With these 2.2 million documents, we try to make them as publicly accessible as possible without harming the privacy of citizens. Last month, we came out with the RECAP Archive, which is essentially a search interface for our database of documents. We now allow users to search full text across just the metadata associated with the case. You can search across all the documents we had for case title, case number or judge. If there's a summary of the documents, you can search over all of the metadata on the docket. We haven't enabled full text search of the actual PDF or of the brief yet because that's where a lot of the PII is going to be found.

What about the cost of making court records available? Is there a rationale for charging for access?

The other issue with PACER -- and it's hard to ignore -- is cost. The reason why the courts charge money for these public domain documents is that Congress authorized them to. In the 2002 E-Government Act, Congress essentially said that they"re allowed to charge you their fees to recoup the cost of running this public access system, only to the extent necessary to recoup these costs. The courts determined at the time that that should be $0.07 a page and eventually upped that per page access rate to $0.08 per page. But if you look at their budgeting documents, we've found that they actually charge a lot more than the expense necessary to provide these documents. My colleague, Steve Schultze, has done a ton of work digging into the federal judiciary budget. We found that about $21 million every year looks like it's being spent directly on running the PACER systems. That includes networking, running servers, or directly to providing public access through PACER. Their revenue in 2010 is projected to be -- I believe -- $94 million. So there"s a $73 million difference this year in the amount of money that they"re collecting versus the amount of money that they're spending on public access. That $73 million difference is thrown into this thing called the Judiciary Information Technology Fund or the JIT Fund.

The JIT Fund is being used on other court technology projects, like flat screen monitors, telecommunications, embeddable microphones in court benches. I'm not opposed to these projects being funded and more technologies in courtrooms, but these projects are being funded at the expense of public access to the law, including the ability for researchers and others interested in our judicial process to access and study how the judicial process works, which I think is highly detrimental to society.

You've offered a thorough walkthrough of many of the issues that were raised at the Law.gov workshop earlier this year. What is the next step in opening up the court system in a way that the American people can find utility from those efforts?

I think the ball is essentially in Congress' court, so to speak. The courts need to work together with Congress to find the right appropriation structure such that PACER is funded not by user feeds but can be supported by general appropriations. Only in that case could the courts take down that user pay wall and allow all of these documents to be freely available and accessible. It's important to look at exactly how much money Congress needs to appropriate to the courts to actually run the system. I think $21 million isn't necessarily the right number, even though that"s how much they spend today for a couple of reasons.

Carl has done a bunch of FOIA requests to all of the individual executive agencies and found, for example, that DOJ pays the judiciary $4 million ever year to access cases. That"s probably true for a lot of the other agencies or for Congress. They pay the courts to access PACER. So a lot of that money is already coming from general appropriation where taxpayer money goes to DOJ, $4 million and then that is paid out to the courts.

If Congress were able to redirect that money directly, the courts would get that money directly and that would go a long way in making up this $21 million. In addition, the amount of money to run the payment infrastructure, to keep track of user accounts, to process bills, to send out letters, to collect the fees, I"m sure probably would cost a couple million dollars, too. If you take down the pay wall, that whole system doesn't even need to be run.

From a policy perspective, I think it's important for Congress and the courts to look into how much money is being sent by using taxpayer money already on running PACER and then directly appropriating that money, along with however, more is necessary on top of that if there's a shortfall to fund the system. Once enough funding is available, then you can take down the pay wall and keep the system running.

There are privacy issues that we need to deal with, certainly in bankruptcy cases, there"s a lot more private information that's left un-redacted, in the regular district appeals courts, appellate courts, probably a bit less. But there are definitely issues that we need to talk about.

What are you focusing on in your doctoral work at Princeton?

On the open government front, I've been looking into a variety of topics in privacy and authentication of court records. I think that's extremely important, especially as the focus is on publishing raw data and third-party reuse of data, in terms of re-displaying government data through third parties and intermediaries. It's also important that governments start to focus on the authentication of government records.

By authentication, I mean actual cryptographic digital signatures that third parties can use to verify that whatever dataset that they downloaded, whether it's from the government directly or from another third party, is actually authentic and numbers within the data that haven't been perturbed or modified, either maliciously or accidentally. I think those are two issues definitely will be increasingly important in the open government world.

What will your talk on "Government Data and the Invisible Hand" at the Gov 2.0 Summit examine?

When we try to do open government, government tries to look at the data that they have and try to publish it. Then they get to a certain technological limit, where an important dataset that they want to publish is on paper file or is in a digital record but not in any machine-parsable way. Or records are available in some machine-parsable way, but there are privacy problems. When we talked about open government and innovation, I think a lot of people have been focusing on user-facing innovation, where the data had been published and the public goes out and takes that and makes user-facing interfaces.

There's also back end innovation, where tools that enable government to better build this platform and sharpen this platform make the front-end innovation possible. These things include better redaction tools for privacy that make it more efficient for government to find private information in their public records. Or tools that help government source data at its creation in machine-readable formats, rather than doing it the same old way and then having some very complex and leaky process for converting Word documents or other non-parsable documents into machine-parsable formats. I think there's a lot of innovation that needs to happen in the tools that government can use to better provide the open platform itself.

by Alex Howard at September 06, 2010 01:11 PM

Commercial Open Source Software

Open Innovation Awards 2010: List of Finalists

Oct
1
2:00 pm

The Jury of the Open Innovation Awards 2010 - the international competition for Open Source projects being organized as part of the Open World Forum in Paris on 1 October 2010 – has published the list of finalists.

The award is organized by the Open World Forum with operational support being provided by the GT Logiciel Libre (free software) of the Systematic competitiveness cluster. The full list of the nominated companies and jury’s members are reported below.

The nominated companies:

  • Activeon (France)
  • Disruptive Innovations (France)
  • Conecta (Italy)
  • Hedera Technology (France)
  • iceScrum (France)
  • Jaspersoft (USA)
  • Mozilla (USA)
  • Obeo (France)
  • Pentaho (USA)
  • O-Engine (China)
  • Pilot Systems (France)
  • Talend (France)
  • XWiki (France)

The jury:

  • Larry Augustin
  • Jean-François Caenen
  • Jean-Marie Chauvet
  • Stefane Fermigier
  • Jean–François Gallouin
  • Roberto Galoppini
  • Thierry Koerlen
  • Jean-Noel Olivier
  • Bruno Pinna
  • Alain Revah

The finalists will compete against each other on 1 October 2010 from 2:00pm to 4:00pm at the Open World Forum (Demo Cup).

by Roberto Galoppini at September 06, 2010 12:05 PM

Linux Today

Adopting Enterprise Open Source Software

OStatic: "I did the upgrade, and made the meeting in time to get a cup of coffee on the way. This is how upgrades should work for everything."

September 06, 2010 11:04 AM

O'Reilly radar

Four short links: 6 September 2010

  1. Akihabara (Github) -- open source (GPL2 and MIT dual-licensed) HTML5/Javascript engine for classic arcade games. (via chadfowler on Twitter)
  2. Eureka Streams -- open sourced Java app for enterprise Twitter-like activity: build a profile, join groups, post updates, subscribe to updates from individuals or groups. (via dlpeters on Twitter)
  3. Open Microbiome -- hoping to build open tools, standard samples, data, and metadata for analysis of the microbiome (all the microorganisms that live in, on, and with macroorganisms like us). Early days, but glad to see people are already thinking of building this research open from the ground up. And if you think sequencing the human genome gave us a lot of data we struggle to find patterns in, wait until you start including microorganisms: we have 10x as many bacteria in us as we have cells and the species variety is vast. (via phylogenomics on Twitter)
  4. Habits of Mathematical Minds -- fantastic list of skills and approaches that are hallmarks of many successful minds, not just in mathematics. (via ddmeyer on Twitter)

by Nat Torkington at September 06, 2010 10:00 AM

Linux Today

Using a Bamboo Tablet with Ubuntu 10.04

Linux.com: "If any of these are true you might be looking at one of the many drawing tablets that can be connected, via USB, to your laptop or PC. These tablets make for a far superior graphic experience, giving the artist much more control over the cursor than with a standard mouse."

September 06, 2010 03:04 AM

September 05, 2010

Linux Today

Clonezilla Live

Linux Journal: "Clonezilla is a bootable CDROM designed for partition backup and restoration. Unlike SystemRescueCD, Clonezilla Live doesn't contain an array of utilities, rather it is a single, focused tool. However, if you're interested in simply backing up or restoring whole partitions to or from files, or copying one partition onto another, Clonezilla might be just what you're looking for."

September 05, 2010 11:06 PM

GigaOM by Om Malik

Chris Albrecht

Just in time for the Labor Day weekend, we bring you the second installment of our web TV series BBQ&A. In this episode, oDesk CEO Gary Swart turned up the heat with a homemade feast fit for a Silicon Valley king, featuring a super-easy cedar plank salmon, teriyaki tri-tip, corn salad, Caprese salad, and grilled asparagus. (We ran out of time before he could grill up his nectarines to put on the homemade marscapone ice cream).

(Video editor’s note: It was a scorching hot, cloudless day in Palo Alto, Calif. and this supernova-like illumination created harsh shadows, made us squint to save our eyes, and caused some serious sweat.)

Expert and noob grillers alike should definitely try out Swart’s salmon recipe. Not only is it delectable, but it’s easy to make, taking just 20 mins to cook. (I made it on my own last weekend.)

We want to also give a big thank you to The Private Bank of the Peninsula for letting us use their fantastic rooftop patio. If you’re a tech exec who likes to cook and wants to be a guest on our show, email my first name at gigaom dot com.

Cedar plank salmon

1 cedar plank
2 pound salmon fillet
Salt and freshly ground black pepper
6 tablespoons Dijon mustard
6 tablespoons brown sugar

Soak cedar plank in salted water for 2 hours. Remove skin and any remaining bones from salmon fillet. Rinse under cold running water and pat dry with clean paper towels. Place the salmon on the cedar plank and generously salt and pepper, carefully spread the mustard over the top and sides and sprinkle brown sugar on top over the mustard.

Set grill to medium high and place the cedar plank in the center of the hot grate, away from the heat. Cover the grill and cook salmon through, around 20 to 30 minutes to an internal temperature to 135 degrees. Transfer the salmon and plank to a platter and serve right off the plank.

Corn Salad

6 ears of sweet white corn, shucked
1 cup diced red onion (1 medium onion)
1 cup red pepper (1-1/2 peppers)
1 large diced mango
1 large diced & seeded cucumber
5 tablespoons cider vinegar
5 tablespoons good olive oil
1 teaspoon kosher salt
1/2 teaspoon freshly ground black pepper
1/2 cup chopped fresh cilantro leaves
1/8 cup chopped mint

In a large pot of boiling salted water, cook the corn for 3 to 4 minutes, just until the starchiness is gone. Drain and place the cobs in ice water to stop the cooking process. Once the corn is cool, cut the kernels off close to the cob.

In a large bowl, mix the corn, onion, peppers, cucumber, vinegar, olive oil, salt, pepper, cilantro and mint. Just before serving add the diced mango (to avoid the mango getting too soggy). Taste for seasonings and serve cold or at room temperature. Salad can be made the night before to let the ingredients marinate.

Teriyaki Tri Tip

1 cup soy sauce
1/4 cup olive oil
1/3 cup Japanese sweet rice wine
4 1/2 teaspoons rice vinegar
1 teaspoon sesame oil
1/4 cup white sugar
5 cloves garlic, peeled and chopped
1 tablespoon minced fresh ginger
1 dash red pepper flakes
black pepper to taste
ground black pepper to taste
4 pounds beef tri tip, cut into 1 inch slices

Bring rice wine to a boil in a saucepan over high heat. Add in the soy sauce, olive oil, sesame oil, sugar, garlic, ginger, pepper flakes, and pepper. Reduce heat and let simmer for 15 minutes. Remove from heat and let the marinade cool. Marinade can be made the night before and stored in the refrigerator to cool. Note: there are some decent commercial teriyaki sauces that you can buy to reduce prep time on this dish.

Once the marinade is cooled in the refrigerator, place the beef tri tip in the marinade, being sure to cover as much of the meat as possible. Cover, and marinate in the refrigerator at least 4 hours.

Preheat an outdoor grill for high heat, and lightly oil grate. Grill the beef 3 to 6 minutes per side, or to desired temperature. Discard the remaining marinade.


Alcatel-Lucent NextGen Communications Spotlight — Learn More »

Just in time for the Labor Day weekend, we bring you the second installment of our web TV series BBQ&A. In this episode, oDesk CEO Gary Swart turned up the heat with a homemade feast fit for a Silicon Valley king.

by Chris Albrecht at September 05, 2010 11:00 PM

OStatic blogs

The Many Faces of Linux

penguin

Linux may have started out small, but it’s grown by leaps and bounds. Today, Linux can be found on everything from a home wireless router to the gigantic mainframe in the data center. Although the spirit of openness surrounds Linux, thanks in part to the GPL, distinct communities have sprung up to support the different environments, each with a slightly different take on what it means to be in the Linux community.

Desktop The most famous form of Linux, the type that used to get the press, has got to be Linux on the desktop. Supporters of the Linux desktop range from those who value the open source license above all else (the same type of Linux user who posts words like FREEDOM in ALL CAPS in online flame wars), to technically inclined people, to the simply curious. I’ve personally been following the Linux desktop “movement” since 1999, back when Linux Magazine was “Chronicling the Revolution”, a reference to Linux’s impending superiority over Windows as the operating system of choice for personal computers. Year after year, Linux has gotten better, but dominance on the desktop remains elusive. For many users, this is not a problem. They have their customized Debian desktop just the way they like it, thank you very much, and don’t need anyone’s approval for it. For others though, recent developments in the next category of Linux users has people asking, “Does Linux Need the Desktop?”

Mobile Mobile Linux has exploded in the past year, thanks to Google and their Android operating system. With Android, Linux is finally able to reach the casual user audience that was so difficult to reach on the desktop. Android is the top competitor to Apple’s iPhone, and possibly soon to be with the iPad as well. Android has done the one thing that was seemingly impossible on the desktop, surpass Microsoft on a consumer device. However, the freewheeling development of the desktop doesn’t perfectly equate to mobile devices. Carriers retain a lot of control over what you can and can not do to your phone, and even approved apps still need to play by Google’s (admittedly lax) rules.

Server It is here, in the datacenter, that the true domination of Linux is apparent. Before Android, there were really two main camps of Linux: servers and desktops. Many distributions support both, and some even have a different ISOs to download for the server. Linux can provide all of the services of Windows, all of the power of Unix, and the web hosting genius of the LAMP stack that has really pushed the platform forward. It’s perfectly reasonable to argue that many of the latest revolutions of the web would not have been feasible, or at least more expensive, under Unix or Windows. As a sysadmin, I obviously fall squarely in this camp, and while I believe that Linux and open source software is the best choice for the data center, I continue to be skeptical of it’s performance on other platforms. That is, of course, with the notable exception of the next form of Linux.

Embedded I would be amiss not to mention Linux in firmware, and the several projects that exist to replace proprietary firmware with open source Linux versions. Years ago I updated my wireless router with the DD-WRT firmware, and had absolutely zero problems with it. Other projects, like Coreboot aim to replace the BIOS in PCs with open source systems that have more options. Linux has even made its way into devices that in no way resemble their desktop or server cousins. Linux is powering everything from televisions to cameras to GPS units, and even the popular Kindle from Amazon.

When talking about Linux, it helps to distinguish what kind of Linux you are referring to. The core Linux kernel is amazingly capable and flexible, and has made its way into as many devices as there are CPUs to power them. It’s important to take note that Linux on the server is a world of difference away from Linux on the desktop, in both purpose, use, and functionality.

Related Activities

Related Software

Related Blog Posts


Alcatel-Lucent NextGen Communications Spotlight — Learn More »

by Jon Buys at September 05, 2010 08:44 PM

Linux Today

Are You Intimidated By Breakfast Cereal?

O'Reilly Broadcast: "The trouble with Linux: there's too much choice. To Mr. Morrison and all the others who have written articles like this one I say: Hogwash!"

September 05, 2010 07:06 PM

Announcing WriteType 1.0.98

A High School Student's Views on Software Freedom: "The next version of WriteType (1.0.98) is now available for download! WriteType is a word processor designed to make typing easier and more efficient for young students and students with disabilities."

September 05, 2010 03:06 PM

How to install PC-BSD on an encrypted ZFS file system

LinuxBSDos: "As a desktop distribution built atop FreeBSD, PC-BSD makes available to the desktop user all the cool technologies inside FreeBSD. One of those cool technologies is ZFS, the Zettabyte File System"

September 05, 2010 11:06 AM

Ubuntu 10.10 beta arrives with new netbook UI

ars Technica: "Canonical has announced the availability of the Ubuntu 10.10 beta release. The new version of the popular Linux distribution, codenamed Maverick Meerkat, is scheduled for final release in October. It brings some noteworthy user interface improvements and updated software."

September 05, 2010 07:06 AM

Webcam server on Linux 2

Linuxaria: "My guide on how to make a webcam server has had a lot of feedback and so I decided to give two other methods to achieve it with two different software webcam-server and ZoneMinder"

September 05, 2010 03:06 AM

September 04, 2010

GigaOM by Om Malik

gigaguest

While settling on a standard big data stack is deeply important to the big data industry as a whole, I’m nonetheless questioning the operational and competitive consequences for companies who choose to buy into this standard without first considering the value of building a proprietary solution.


Alcatel-Lucent NextGen Communications Spotlight — Learn More »

by Shion Deysarkar at September 04, 2010 11:09 PM

Linux Today

PS3 hack source code published

The H Open: "This apparently allows a programmable USB development board with an AT90USB microcontroller to be used to circumvent the PS3's security systems and execute unsigned code."

September 04, 2010 11:03 PM

GigaOM by Om Malik

istock_000003107697small2

Nova Spivack’s recent GigaOM post, “Trailmeme and the Web of Intent,” highlights the growing content clutter problem on the web, but frames the solution set too narrowly and too far into the future. In fact, more robust content filtering tools and the Web of Intent will arrive sooner than you think, based on the implicit messages in users’ actions.

Solutions like Trailmeme that help consumers more easily save, tag, annotate, inter-link and share related content as a way to better filter the web sound promising, but we shouldn’t put all the pressure on crowd-sourcing solutions that require consumers to clean up the entire stream.

The Web of Intent will be largely driven by consumers’ actions and interests.  It will be based on implicit actions consumers take around what is most important to them. There’s an emerging opportunity for content publishers (and the publishing technologies they rely upon) to dramatically improve how they filter the stream for the consumers they serve. Once they do, consumers will embrace these improved, personalized content offerings and will in turn provide valuable feedback and insight through the actions they take with the content offered. Here are a few publishing trends that will accelerate the Web of Intent:

  1. Search and publishing tools will become more integrated, offering new forms of publishing flexibility that marry a publisher’s originally authored content assets with the best related content from the aggregated and real-time web.
  2. Search and publishing integration will help editors more easily monitor the web and curate new content packages across different content types, including articles, blog posts, tweets, photos and video.
  3. Publishers will start to produce curated, topical or thematic content “feeds” for their target audiences. For example, consumers will be able to subscribe to curated sports feeds for the latest news about their favorite teams or athletes or gadget feeds covering digital cameras or iPad news.
  4. Publishers will also offer more engaging (and valued) user experiences for consumers who “opt-in” to these personalized, filtered feeds providing convenient updates wherever consumers go. Think a better version of Google Alerts — curated by skilled editors from your favorite publisher and available anywhere (Facebook, Twitter, MyYahoo, iPad, iPhone etc.).
  5. Consumers will be able to customize these feeds across topics or stories, prioritize sources, receive recommendations and discover new content via their friends and social graph. New forms of social sharing (community) will emerge organized around consumer’s interests and the curated feeds they subscribe to.
  6. A few years ago when I was at Edmunds.com, we implemented an early form of the Web of Intent. For example, if a consumer was interested in a Sedan or BMW 3 Series they would click a link to get more information. As the publisher, we started to understand their intent through their implicit actions and fashioned a dynamic content and monetization experience designed to satisfy their specific interest. To support this, we had to significantly re-architect the way we thought about the design of the site and our entire content and advertising operations to organize around the consumer’s interests. We built everything ourselves, and that investment paid off as the site became the top auto research destination on the web and we significantly increased our revenue per user.

    As we look ahead, next-generation content publishing tools will make this transition much easier for publishers. They will be able to quickly transform their content operations beyond articles and blog posts into data and interest-centric publishing structures that allow consumers to follow topics and ongoing stories of interest. As consumers follow their favorite topics or stories, publishers will be able to build a Web of Intent rich in data and profiling based on their audiences’ interests. These interests will offer newer and more robust targeting opportunities and will ultimately provide publishers new opportunities for monetization beyond pure advertising.

    The good news for consumers is a number of large publishers are already actively working on these problems and are in the process of redesigning or re-launching their websites to make their sites more “intent-friendly”. Additionally, innovative tech companies are emerging such as Magnify.net in video curation or my6sense which help create personalized content streams.

    The Web of Intent will be here sooner than you think.

    Matthew Kumin is the former EVP, Media for Edmunds.com and co-founder and CEO of PublishThis, a next-gen content publishing platform.


Alcatel-Lucent NextGen Communications Spotlight — Learn More »

by Matthew Kumin at September 04, 2010 10:00 PM

OStatic blogs

Maturing as a Linux Systems Administrator

SAGE

We had a visit from a vendor the other day, one who sells high end Unix hardware. The meeting was informative, and overall went well. While I was walking the vendor out of the building, he turned to me and told me how learning his version of Unix would really help me as I matured in my profession. He continued to say how any company who needed a Linux admin could just grab a kid right out of college, because, according to him, it was no big deal. I understand his position as a vendor, and wanting to push his proprietary software. I also understand that he’s right that learning new things is good for any career, but he’s completely wrong on his perception of Linux.

Linux is a complicated operating system, but one that has grown exceptionally fast. After dealing with Linux for over a decade, you come to the understanding that Linux, like ogres and onions, has layers. As soon as you think you have a good solid understanding of something, you can peel it back and look at the layers underneath, and realize that much to learn, still you have. An understanding of the deeper complexities of a load-balanced Linux cluster is not something you just pick up over night. It takes knowledge of networking, the TCP/IP stack, arp caching, reverse arp, scripting, system utilization and fault isolation. It is true that a novice administrator can follow an instruction book to install the necessary software and get a cluster up and running, but it takes a senior sysadmin to fix it when it breaks. I’ve questioned before the future validity of systems administrators, but I’m confident now that choosing to be a Linux sysadmin is a great career path, if you dedicate yourself to continually growing.

The best place to turn for information concerning the field of systems administration is SAGE, the Systems Administrators Guild. SAGE is a special interest group of USENIX, the Advanced Computing Systems Association. SAGE publishes professional pamphlets and articles, and has defined a set of criteria for advancing from novice to senior sysadmin roles. SAGE also publishes a code of ethics encouraging members to maintain high standards of personal integrity, especially when dealing with sticky issues like privacy, social responsibility and law. One of the best benefits of a SAGE membership is access to the mailing list, which gives you a group of peers to bounce questions off and discuss relevant matters in open source. Joining SAGE is a great step in furthering your career as a sysadmin, and one that shows you take your field seriously.

Finally, one of the greatest signs of a mature systems administrator, no matter what platform he specializes in, is patience. Admittedly, this is an area I’m still working on, and probably will be for the rest of my life. It takes patience to write good documentation, it takes patience to throughly test a system before it’s put into production, it takes patience to ensure systems are patched on time, and that the patches are tested before they are put into production. It takes patience to know that the cool new thing might not be whats best for your environment. It takes patience to recognize that voice in the back of your head that says something that you are looking at is not quite right. And, it takes patience to smile and nod to vendors who speak condescendingly about your profession.

Related Activities

Related Software

Related Blog Posts


Alcatel-Lucent NextGen Communications Spotlight — Learn More »

by Jon Buys at September 04, 2010 07:14 PM

Linux Today

Oracle offers student coders free access to JavaOne

Computerworld: "Oracle is hoping to entice a younger generation of Java programmers into its fold by offering students complimentary admission to the upcoming JavaOne and Oracle Develop conferences."

September 04, 2010 07:06 PM

Your Linux system keeps falling and it can't get up

Stubborn Tech Problem Solving: "Once in a while a Linux PC technician will encounter a system that has problems with lockups (a.k.a. hanging or freezing). Sometimes it is failing hardware but other times it's a software problem. Here are the common causes for this and how to identify which is the source of your problems."

September 04, 2010 05:06 PM

Weekend Project: Serve Up Your Own OpenID with Open Source Tools

Linux.com: "Last weekend, we looked at how to enable your Web site to accept OpenID logins. But accepting OpenID authentication is only half of the issue — if you care about online privacy and identity, the chances are that you will want to control your own OpenID."

September 04, 2010 03:06 PM

GigaOM by Om Malik

gigaom_icon_google-android

The collective groans of supervisors all over the world was heard this week as the phenomenally successful Angry Birds game was released for the Android platform. Angry Birds is the addictive game that’s been setting the iPhone (and iPad) world on fire. The Android release is an early beta, available in the Android Market as a free download. Rovio Mobile, producer of the game, had intended to conduct a closed Android beta, but pleas from prospective players convinced Rovio to release it publicly. Angry Birds has been wildly successful for a mobile platform game, with 7 million paid players and 11 million unpaid. Don’t miss Om Malik’s interview with the game’s creators to see where Angry Birds is headed.

The giant IFA electronics show is underway in Germany, and as expected, Android tablets are on display. Korean electronics maker Samsung unveiled the Galaxy Tab, and it appears to be a serious competitor for the iPad. The 7-inch tablet is stuffed with the same electronics found inside Samsung’s popular Galaxy S phone line, and has software optimized for the larger Tab screen. Pricing hasn’t been announced by the company, but it will be available in Europe shortly and the U.S. later this year.

Android phone owners wishing there was a seamless way to work with desktop music files similar to the iPhone/ iTunes method should take a look at doubleTwist. We took it for a spin and found it to be a full-featured way to sync Android phones to iTunes music, photos and movies. The program has integration with the Amazon MP3 store for music and the Android Market for working with Android apps. Perhaps best of all, doubleTwist is free.

Related GigaOM Pro content (sub req’d): Forget, Syncing, Let’s Put Music in the Cloud!


Alcatel-Lucent NextGen Communications Spotlight — Learn More »

by James Kendrick at September 04, 2010 01:00 PM

Linux Today

Systemd and Fedora 14

LWN.net: "Systemd, an alternative to Upstart or System V init, has made big strides since it was announced at the end of April. It has been packaged for Fedora and openSUSE, and for users of Fedora Rawhide, it gets installed as the default."

September 04, 2010 11:06 AM

Samsung: Galaxy Tab has leg up on Apple iPad

Cnet: "When Samsung debuted its Galaxy Tab on Thursday, it made a bold claim: the device is at least as good as today's dominant tablet, Apple's iPad."

September 04, 2010 07:06 AM

Webcam server on Linux

Linuxaria: "In these days all the new netbooks and most laptops have built-in webcam, or at least add one external costs very little, apart from the traditional use in video calls you can use our Linux box to create a monitoring device"

September 04, 2010 03:06 AM

September 03, 2010

GigaOM by Om Malik

Pingcap

Apple iTunes’ Ping launched Wednesday night to a flurry of chatter lauding it as a MySpace killer, only to land Thursday morning amid criticism and a nasty break-up with Facebook. Now that the dust is beginning to settle, let’s take a look at what Ping is, as well as what it could have been.

Om claimed that Ping is the future of social commerce, but its sole focus on purchases and its presence behind a walled garden may hinder that bright future. Here are the four main issues Apple has to work on quickly for Ping to be successful:

Ping’s main method of social interaction is based only on new purchases.

In order to populate a social network of any kind, you need to give users the ability to share whatever it is they bring with them. I have over 6000 songs on my laptop alone, and that’s not including what I have stored on my external drive. Probably a third of those were purchased via iTunes. In order to share any of that music with those who are following me, I have to click out of Ping and into the iTunes Store, find an album I already own, and click “Like” on the drop-down menu. It’s a counter-intuitive UI involving too much effort on the part of the user. My music stream — and my actions on certain favorite or hated songs — can already be shared on other services. Why would I bother going through all that when I can click “Love” on last.fm (which is already running in my dock) and share that song through another social network, which may already be providing direct links to Amazon MP3 or another service for purchase?

In addition, Ping is so divorced from the iTunes experience that when I let it auto-populate my “Music I Like” selections at sign-up, soon I had 10 selections of my kids’ music, which they bought with gift cards they’d received as presents. No, Ping, those one-star selections of Hannah Montana and the Chipmunk movie soundtracks are not “music I like.” In fact, when rating that music in iTunes, I think that one-star rating I assigned the kids’ music said I really didn’t like it at all. Buying does not equal liking.

Ping doesn’t allow you to create new tangential conversations, or share additional statuses, locations, or activities with your social graph.

In order to begin any conversation on Ping, I have to do something involving the iTunes store: hunt down a song or album and Like or Post it, or buy something. I can’t begin a conversation with “Hey, did any of you catch that live Arcade Fire show on YouTube? What did you think?” Again, there’s an opportunity here for smart-linking to products based on organic conversations, and Apple is missing the boat. I may think to leave a comment if a purchase or a “like” happens by in my stream, but if I’ve already liked or bought an album and want to bring it up later, I’d have to go digging for the old conversation. I can’t start another one.

The concept may be Apple, but the UI certainly isn’t.

The one point that any Apple fanboy (or fangirl) has always been able to make without argument is how intuitive Apple’s UIs have always been. It’s the original company with a plug-in-and-go M.O. for its products, yet even on Wednesday night, when I joined Ping and talked about it with the early adopter crowd on Twitter, we were all stumbling about. If people who have more than 400 log-in IDs for social networks are confused about how to go about interacting with each other on a social site, how will an average user be able to figure it out? The familiar status box you see at the top of the screen on every social network is missing. If you want to comment on another user’s activity, you have to seek out a small link to pop up a comment box (an existing whitespace would be much more obvious). Worst of all, the drop-down menu that appears on albums or songs in your stream appears to be part of the “Buy Album” button by design, which could make wary users afraid they might purchase the album rather than comment on it.

The social graph is missing.

This is Creating a Social Network 101. No one wants to sign up for a new service, only to manually seek out the same group of people they are friends with on Facebook and follow on Foursquare. They want a quick and simple method of importing contacts from a service like Facebook Connect. The post-launch implosion of an Apple-Facebook deal to piggyback on Facebook’s social graph was a devastating blow for populating a new service. Apple needs to do something quickly to replace it, or those signing up will quickly tire of logging in only to find there’s no new activity in the past three hours. Social networks need a constant stream of activity to keep users engaged.

As for trying to convince my friends to use the service, they don’t want to be bothered, for the most part. They’re already using services on Facebook or MySpace to share music in a much simpler fashion than Ping is providing. In the nearly 48 hours since launch, I’ve assembled a circle of real-world friends and tech connections that, with combined followers and those I follow, is less than 30. Considering I have over 300 friends on Facebook, that’s a pretty small percentage of my social graph.

I listen to my iTunes library during my work day, alternating with Pandora. While I’ll frequently click over to last.fm (if iTunes is running) or Pandora to like or block a song, it’s too hard to do that with Ping because I have to take a break from whatever I’m doing to hunt it down. My other social music sharing is as simple as flipping to another window and clicking a single button. When I do check in to Ping, I have to manually refresh and it’s often hours between updates. Right now, Ping is a lonesome place that seems to be populated only by diehard early adopters and Apple fans.

Related content from GigaOM Pro (sub req’d):

With Ping, Apple Builds a Social Network Inside a Walled Garden


Alcatel-Lucent NextGen Communications Spotlight — Learn More »

by Cyndy Aleo at September 03, 2010 11:30 PM

Linux Today

Editor's Note: Holiday Tech Fun

What to do on a long weekend? Digikam, Ardour, Drupal, new computers....

September 03, 2010 11:06 PM

GigaOM by Om Malik

Facebook social search1

As expected, Facebook has started integrating social activity from around the web into the search results on its site, by showing how many people “liked” or shared a specific news story or blog post, as shown in the screenshot below (first noticed by All Facebook). The results are powered by the social-graph plugins embedded in hundreds of thousands of websites, which Facebook launched earlier this year at its F8 conference. The new feature is the latest step in rolling out the network’s social-search engine — which could become a competitive threat for Google and other traditional search companies, as more users turn to recommendations from their networks instead of those determined by algorithms.

Related content from GigaOM Pro (sub req’d): Why Google Should Fear the Social Web


Alcatel-Lucent NextGen Communications Spotlight — Learn More »

by Mathew Ingram at September 03, 2010 10:54 PM

Linux Today

Mint 9: Minty fresh Linux

Cyber Cynic: "I currently use OpenSUSE on my servers, and Ubuntu, Fedora, and MEPIS on my desktops and laptops. Now, I have a new resident on my desktops: Mint 9."

September 03, 2010 10:36 PM

GigaOM by Om Malik

ryangigaom

While on-demand services like YouTube and Hulu get all the press, live streaming services have posted dramatic increases in viewer engagement over the last year. Time spent watching live online video increased nearly 650 percent in that time, to 1.4 billion minutes a month.

Alcatel-Lucent NextGen Communications Spotlight — Learn More »

by Ryan Lawler at September 03, 2010 10:10 PM

Linux Today

Amnesia: The Dark Descent Demo Released!

Linux Gaming News: "Today Frictional Games, the makers of the Penumbra series, has released a demo for their upcoming first person survival horror “Amnesia: The Dark Descent”. Players with Windows, Mac or Linux all get a chance to experience a small slice of the dread and terror contained in the full game."

September 03, 2010 10:06 PM

Android: the return of the Unix wars?

LWN.net: "Your editor was recently amused to encounter this ZDNet article on "Android's dirty little secret." According to that article, the openness of Android has led to an increase in the control held by handset manufacturers and wireless carriers and the fragmentation of the platform."

September 03, 2010 09:36 PM

Songbird 1.8.0 adds support for more devices

The H Open: "Nearly three months after the last update, the Songbird developers have released version 1.8.0 of their open source media player, code named "Orbital"."

September 03, 2010 09:06 PM

Asterisk VoIP News

Skype Introduces 10-Way Video Calling

Skype — apparently pleased with its five-way beta group video-calling functionality — has just released a new version of Skype 5.0 for Windows that doubles group support. It now allows for up to 10 video callers.

Skype 5.0 beta two is already available for download; it includes 10-way video calls, automatic call recovery and a cleaner user interface. The update is also said to improve call quality and includes a number of bug fixes to make the overall experience much smoother.

Of course, the standout feature is 10-way video calling, something that certainly one-ups their own previous offering and makes it suitable for even larger virtual team meetings and mini family reunions. Of course, it also makes Gmail’s (Gmail) video-calling functionality look like the ugly step sister — a proactive move on the part of Skype (Skype) to combat recent buzz surrounding Gmail Voice Calling.

Still, Skype does caution that 5.0 is beta, and hence, very buggy. It’s also limited to Windows (Windows) users, and 10-way video calls require all group chatters to be using the same second beta version of the app. Have you tested out five-way video calls? Are you ready to upgrade to the 10-person variety?

Source

by Dal at September 03, 2010 09:01 PM

GigaOM by Om Malik

gawker-social-media-referrals

Updated: As a number of readers have pointed out, I read this chart incorrectly. I read absolute numbers of visitors into it, whereas the chart is actually meant to show percentages of traffic. I’ve tried to correct my mis-impressions below — although the chart does still show that Stumbleupon drives a lot of traffic, so my conclusions are not completely wrong.

In a chart of the traffic it gets from a variety of social networks (posted by All Things Digital), Gawker Media showed that the number one referrer is Reddit — which has been gaining momentum recently, thanks in part to the reaction to Digg’s relaunch, but was number one as far back as last year Facebook. Some observers used the chart to point out how much Digg has fallen off in terms of its influence, while others used it to show how Twitter is not as mainstream as most people think. What struck me, however, was how little attention gets paid to the number two and three of the leading referrers on that graph: namely Fark and Stumbleupon.

According to my reading of the chart, Fark drove more than 18 million visitors to Gawker in August — up from less than 10 million in the same period last year — while Stumbleupon drove about 16 million, about double what it sent the network of websites in the same month a year earlier. Twitter accounted for around 12 million (which was still twice the number of visitors it drove a year ago) and Digg was responsible for just under 10 million of the site’s 20 million total visitors in August. Facebook is still in last place, according to Gawker’s numbers, but it’s been growing sharply.

The chart shows that the largest proportion of Gawker’s 20 million total visitors in August came from Facebook, which has gone from being an also-ran to the overall leader in the past year. The proportion of traffic that comes from Digg has shrunk over that period, but the traffic coming from Stumbleupon has grown substantially, putting it a close second place (judging by the size of the colored bars representing each site). Reddit’s share of traffic has also grown, as has Twitter’s.

Fark, which was founded by Drew Curtis in 1999 as a way to share funny links with his friends, doesn’t get a lot of press attention. In fact, the service is virtually never mentioned in stories about Digg’s redesign or the rivalry between Digg and Reddit, or even in stories about Twitter and its growth as a traffic driver. Yet it and Stumbleupon drive relatively huge numbers of readers to many websites. Fark reportedly has more than four million unique visitors a month. Is the lack of attention because Curtis keeps a low profile (he still lives in his home town of Lexington, Ky.), or because the site isn’t venture-funded and isn’t located in Silicon Valley? It’s anyone’s guess.

Meanwhile, Stumbleupon — which was started by Canadian Garrett Camp and a small group of friends in 2001 as a spinoff from Camp’s graduate-school research project in Calgary, Alberta — gets a little more press because it was backed by VCs and ultimately bought by eBay in 2007 for $75 million. While there was some attention paid to the company after it was bought back by the founders and a small group of venture investors in 2009, the newly independent startup has kept a pretty low profile since. Despite the lack of headlines, however, many publishers (including GigaOM) know that when a link gets “stumbled,” there can often be a huge influx of readers.

Stumbleupon recently announced that it signed up its 10 millionth user, and the company says it has been growing in other ways; according to a blog post, the number of advertisers has climbed by 20 percent, and the company’s headcount has grown by over 50 percent in 2010. The site recently launched iPhone and Android apps as well. A report from Statcounter on social-media traffic shows Facebook is far and away the leader in referrals to sites that use its analytics service, with about 63 percent, and Stumbleupon is in second spot with 16 percent.

Related content from GigaOM Pro (sub req’d): Why Google Should Fear the Social Web

Post and thumbnail photos courtesy of Flickr user Andrew Poynton


Alcatel-Lucent NextGen Communications Spotlight — Learn More »

by Mathew Ingram at September 03, 2010 08:57 PM

Linux Today

ZaReason Terra HD

Ubuntu Linux Tips & Tricks: "Jono wrote about his new ZaReason Strata, and Rich wrote about his new ZaReason something-else, so I figured I'd let you all know about the ZaReason Terra HD I said I wanted to order, now that I've had it about a month."

September 03, 2010 08:36 PM