[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [kDev] kendraTools: information structure...



Daniel Harris wrote:

Hi Neil and All,

Thanks. I took a look at the Wikipedia page...

At this stage of kendraTools I see no reason to define a strict language and probably never to do so. Each application will make it's own definitions; have their own input restrictions and search criteria. Have a look at the address application (?) that I've just coded for kendraTools:

http://www.kendra.org.uk/tools1/addresses.php

It's very simple (and matches my coding ability) but I hope it shows that we can data mine this mass of relationships any way we like.

Also, take a look at some other ideas for links that other applications may include at some point:

http://moof.org.uk/forum/index.cgi/DevelopmentPlanBook1Links

So, I suggest that we concentrate on the applications and make sure that the "information store" can hold as many different types of information, constructs and relationships as possible and yet still have a simple structure.

Does this make sense?

On the is/was are/were constructs: those are to do with time and perhaps it's better to have an additional relationship to these object stating when this relationship took place...

Cheers Daniel

Dear all,

Here are is a distillation of some recent thoughts, based on discussions yesterday.

--------------------------------------------------------------------------------
The software

0 The system is not designed to solve a single Kendra project: it is a framework for making it trivial to generate solutions for Kendra projects in general.
0a The goal can be described as "bootstrapping the semantic web"
0b More concretely, this means hosting collaboration fora for describing things and facilitating exchanges based on these descriptions, from many points of view, keeping the diversity of views that is central to the Kendra concept

1 The data store stores _everything_ under a name which is a Unicode string (the "Universe of discourse") 1a [there will also be simple namespaces to deal with special / new data types]

2 these things can be (initially)
* structured data objects with fields made from simple types (including names of other objects)
* relational statements of the form "x relation y"
* English or other language plain text comments
* page/report templates -- definition by example, etc.
2a Some operations are restricted to admins for the time being.

3 The system will initially support very simple syntax for declaring templates for new kinds of object, and creating and editing objects.
4 Every user has an account.
5 All assertions are tagged by which user asserted them.
6 The system is designed to emit W3C-compliant Semantic Web pages, as and when that is well-defined. (Just like a Wiki serves standard HTML, as well as supporting its own internal format.) These pages can be exported as standard Web pages that will appear to be part of the static public Web. 6a RDF export will probably suffice initially.

7 Nice web design will need to be added for credibility among general user community. (So we need some nice artwork etc, but in the short term good clean HTML 4.01 + CSS will do for development).

8 Full source + database dumps to be available on servers, so mirrors can be set up by anyone.

9 Need to be able to import data from public domain sources (NIMA database etc), peer with copyleft content (MusicBrainz etc.), and allow copyrighted data owners to participate, without giving up the rights on their data...

Reference points:
* Wikipedia (easy editing, simple namespace, 70,000+ encyclopedia articles)
* Mindpixel (currently 1,000,000 very simple assertions)
* Open Mind Initiative (doesn't say)
* Commonforum (based on MoinMoin wiki)
* MusicBrainz, IMDB...

See also: http://www.wikipedia.org/wiki/Wikipedia:Size_comparisons

-----------------------------------------------------------------------------------
The server cloud

A migration path for the software...

1  single server, running a single copy of the server
2 multiple cooperating servers, running on multiple boxes, run by Kendra, as proof of concept 3 as 2, but with servers run by other trusted organizations, with a central Kendra "mothership" 4 allow anyone to act as a content peer? (requires self-governing community critical mass to manage potential problems)
5  the "mothership" becomes unneccessary.

-----------------------------------------------------------------------------------


The data

1  different licences may be needed per project
2  what dump format to support?