1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
|
NAME
YAML - YAML Ain't Markup Language™
VERSION
This document describes YAML version 1.15.
NOTE
This module has been released to CPAN as YAML::Old, and soon YAML.pm
will be changed to just be a frontend interface module for all the
various Perl YAML implementation modules, including YAML::Old.
If you want robust and fast YAML processing using the normal Dump/Load
API, please consider switching to YAML::XS. It is by far the best Perl
module for YAML at this time. It requires that you have a C compiler,
since it is written in C.
If you really need to use this version of YAML.pm it will always be
available as YAML::Old.
The rest of this documentation is left unchanged, until YAML.pm is
switched over to the new UI-only version.
SYNOPSIS
use YAML;
# Load a YAML stream of 3 YAML documents into Perl data structures.
my ($hashref, $arrayref, $string) = Load(<<'...');
---
name: ingy
age: old
weight: heavy
# I should comment that I also like pink, but don't tell anybody.
favorite colors:
- red
- green
- blue
---
- Clark Evans
- Oren Ben-Kiki
- Ingy döt Net
--- >
You probably think YAML stands for "Yet Another Markup Language". It
ain't! YAML is really a data serialization language. But if you want
to think of it as a markup, that's OK with me. A lot of people try
to use XML as a serialization format.
"YAML" is catchy and fun to say. Try it. "YAML, YAML, YAML!!!"
...
# Dump the Perl data structures back into YAML.
print Dump($string, $arrayref, $hashref);
# YAML::Dump is used the same way you'd use Data::Dumper::Dumper
use Data::Dumper;
print Dumper($string, $arrayref, $hashref);
DESCRIPTION
The YAML.pm module implements a YAML Loader and Dumper based on the
YAML 1.0 specification. http://www.yaml.org/spec/
YAML is a generic data serialization language that is optimized for
human readability. It can be used to express the data structures of
most modern programming languages. (Including Perl!!!)
For information on the YAML syntax, please refer to the YAML
specification.
WHY YAML IS COOL
YAML is readable for people.
It makes clear sense out of complex data structures. You should find
that YAML is an exceptional data dumping tool. Structure is shown
through indentation, YAML supports recursive data, and hash keys are
sorted by default. In addition, YAML supports several styles of
scalar formatting for different types of data.
YAML is editable.
YAML was designed from the ground up to be an excellent syntax for
configuration files. Almost all programs need configuration files, so
why invent a new syntax for each one? And why subject users to the
complexities of XML or native Perl code?
YAML is multilingual.
Yes, YAML supports Unicode. But I'm actually referring to programming
languages. YAML was designed to meet the serialization needs of Perl,
Python, Ruby, Tcl, PHP, Javascript and Java. It was also designed to
be interoperable between those languages. That means YAML
serializations produced by Perl can be processed by Python.
YAML is taint safe.
Using modules like Data::Dumper for serialization is fine as long as
you can be sure that nobody can tamper with your data files or
transmissions. That's because you need to use Perl's eval() built-in
to deserialize the data. Somebody could add a snippet of Perl to
erase your files.
YAML's parser does not need to eval anything.
YAML is full featured.
YAML can accurately serialize all of the common Perl data structures
and deserialize them again without losing data relationships.
Although it is not 100% perfect (no serializer is or can be perfect),
it fares as well as the popular current modules: Data::Dumper,
Storable, XML::Dumper and Data::Denter.
YAML.pm also has the ability to handle code (subroutine) references
and typeglobs. (Still experimental) These features are not found in
Perl's other serialization modules.
YAML is extensible.
The YAML language has been designed to be flexible enough to solve
it's own problems. The markup itself has 3 basic construct which
resemble Perl's hash, array and scalar. By default, these map to
their Perl equivalents. But each YAML node also supports a tagging
mechanism (type system) which can cause that node to be interpreted
in a completely different manner. That's how YAML can support object
serialization and oddball structures like Perl's typeglob.
YAML IMPLEMENTATIONS IN PERL
This module, YAML.pm, is really just the interface module for YAML
modules written in Perl. The basic interface for YAML consists of two
functions: Dump and Load. The real work is done by the modules
YAML::Dumper and YAML::Loader.
Different YAML module distributions can be created by subclassing
YAML.pm and YAML::Loader and YAML::Dumper. For example, YAML-Simple
consists of YAML::Simple YAML::Dumper::Simple and YAML::Loader::Simple.
Why would there be more than one implementation of YAML? Well, despite
YAML's offering of being a simple data format, YAML is actually very
deep and complex. Implementing the entirety of the YAML specification
is a daunting task.
For this reason I am currently working on 3 different YAML
implementations.
YAML
The main YAML distribution will keeping evolving to support the
entire YAML specification in pure Perl. This may not be the fastest
or most stable module though. Currently, YAML.pm has lots of known
bugs. It is mostly a great tool for dumping Perl data structures to a
readable form.
YAML::Tiny
The point of YAML::Tiny is to strip YAML down to the 90% that people
use most and offer that in a small, fast, stable, pure Perl form.
YAML::Tiny will simply die when it is asked to do something it can't.
YAML::Syck
libsyck is the C based YAML processing library used by the Ruby
programming language (and also Python, PHP and Pugs). YAML::Syck is
the Perl binding to libsyck. It should be very fast, but may have
problems of its own. It will also require C compilation.
NOTE: Audrey Tang has actually completed this module and it works
great and is 10 times faster than YAML.pm.
In the future, there will likely be even more YAML modules. Remember,
people other than Ingy are allowed to write YAML modules!
FUNCTIONAL USAGE
YAML is completely OO under the hood. Still it exports a few useful top
level functions so that it is dead simple to use. These functions just
do the OO stuff for you. If you want direct access to the OO API see
the documentation for YAML::Dumper and YAML::Loader.
Exported Functions
The following functions are exported by YAML.pm by default. The reason
they are exported is so that YAML works much like Data::Dumper. If you
don't want functions to be imported, just use YAML with an empty import
list:
use YAML ();
Dump(list-of-Perl-data-structures)
Turn Perl data into YAML. This function works very much like
Data::Dumper::Dumper(). It takes a list of Perl data structures and
dumps them into a serialized form. It returns a string containing the
YAML stream. The structures can be references or plain scalars.
Load(string-containing-a-YAML-stream)
Turn YAML into Perl data. This is the opposite of Dump. Just like
Storable's thaw() function or the eval() function in relation to
Data::Dumper. It parses a string containing a valid YAML stream into
a list of Perl data structures.
Exportable Functions
These functions are not exported by default but you can request them in
an import list like this:
use YAML qw'freeze thaw Bless';
freeze() and thaw()
Aliases to Dump() and Load() for Storable fans. This will also allow
YAML.pm to be plugged directly into modules like POE.pm, that use the
freeze/thaw API for internal serialization.
DumpFile(filepath, list)
Writes the YAML stream to a file instead of just returning a string.
LoadFile(filepath)
Reads the YAML stream from a file instead of a string.
Bless(perl-node, [yaml-node | class-name])
Associate a normal Perl node, with a yaml node. A yaml node is an
object tied to the YAML::Node class. The second argument is either a
yaml node that you've already created or a class (package) name that
supports a yaml_dump() function. A yaml_dump() function should take a
perl node and return a yaml node. If no second argument is provided,
Bless will create a yaml node. This node is not returned, but can be
retrieved with the Blessed() function.
Here's an example of how to use Bless. Say you have a hash containing
three keys, but you only want to dump two of them. Furthermore the
keys must be dumped in a certain order. Here's how you do that:
use YAML qw(Dump Bless);
$hash = {apple => 'good', banana => 'bad', cauliflower => 'ugly'};
print Dump $hash;
Bless($hash)->keys(['banana', 'apple']);
print Dump $hash;
produces:
---
apple: good
banana: bad
cauliflower: ugly
---
banana: bad
apple: good
Bless returns the tied part of a yaml-node, so that you can call the
YAML::Node methods. This is the same thing that YAML::Node::ynode()
returns. So another way to do the above example is:
use YAML qw(Dump Bless);
use YAML::Node;
$hash = {apple => 'good', banana => 'bad', cauliflower => 'ugly'};
print Dump $hash;
Bless($hash);
$ynode = ynode(Blessed($hash));
$ynode->keys(['banana', 'apple']);
print Dump $hash;
Note that Blessing a Perl data structure does not change it anyway.
The extra information is stored separately and looked up by the
Blessed node's memory address.
Blessed(perl-node)
Returns the yaml node that a particular perl node is associated with
(see above). Returns undef if the node is not (YAML) Blessed.
GLOBAL OPTIONS
YAML options are set using a group of global variables in the YAML
namespace. This is similar to how Data::Dumper works.
For example, to change the indentation width, do something like:
local $YAML::Indent = 3;
The current options are:
DumperClass
You can override which module/class YAML uses for Dumping data.
LoaderClass
You can override which module/class YAML uses for Loading data.
Indent
This is the number of space characters to use for each indentation
level when doing a Dump(). The default is 2.
By the way, YAML can use any number of characters for indentation at
any level. So if you are editing YAML by hand feel free to do it
anyway that looks pleasing to you; just be consistent for a given
level.
SortKeys
Default is 1. (true)
Tells YAML.pm whether or not to sort hash keys when storing a
document.
YAML::Node objects can have their own sort order, which is usually
what you want. To override the YAML::Node order and sort the keys
anyway, set SortKeys to 2.
Stringify
Default is 0. (false)
Objects with string overloading should honor the overloading and dump
the stringification of themselves, rather than the actual object's
guts.
UseHeader
Default is 1. (true)
This tells YAML.pm whether to use a separator string for a Dump
operation. This only applies to the first document in a stream.
Subsequent documents must have a YAML header by definition.
UseVersion
Default is 0. (false)
Tells YAML.pm whether to include the YAML version on the
separator/header.
--- %YAML:1.0
AnchorPrefix
Default is ''.
Anchor names are normally numeric. YAML.pm simply starts with '1' and
increases by one for each new anchor. This option allows you to
specify a string to be prepended to each anchor number.
UseCode
Setting the UseCode option is a shortcut to set both the DumpCode and
LoadCode options at once. Setting UseCode to '1' tells YAML.pm to
dump Perl code references as Perl (using B::Deparse) and to load them
back into memory using eval(). The reason this has to be an option is
that using eval() to parse untrusted code is, well, untrustworthy.
DumpCode
Determines if and how YAML.pm should serialize Perl code references.
By default YAML.pm will dump code references as dummy placeholders
(much like Data::Dumper). If DumpCode is set to '1' or 'deparse',
code references will be dumped as actual Perl code.
DumpCode can also be set to a subroutine reference so that you can
write your own serializing routine. YAML.pm passes you the code ref.
You pass back the serialization (as a string) and a format indicator.
The format indicator is a simple string like: 'deparse' or
'bytecode'.
LoadCode
LoadCode is the opposite of DumpCode. It tells YAML if and how to
deserialize code references. When set to '1' or 'deparse' it will use
eval(). Since this is potentially risky, only use this option if you
know where your YAML has been.
LoadCode can also be set to a subroutine reference so that you can
write your own deserializing routine. YAML.pm passes the
serialization (as a string) and a format indicator. You pass back the
code reference.
UseBlock
YAML.pm uses heuristics to guess which scalar style is best for a
given node. Sometimes you'll want all multiline scalars to use the
'block' style. If so, set this option to 1.
NOTE: YAML's block style is akin to Perl's here-document.
UseFold
If you want to force YAML to use the 'folded' style for all multiline
scalars, then set $UseFold to 1.
NOTE: YAML's folded style is akin to the way HTML folds text, except
smarter.
UseAliases
YAML has an alias mechanism such that any given structure in memory
gets serialized once. Any other references to that structure are
serialized only as alias markers. This is how YAML can serialize
duplicate and recursive structures.
Sometimes, when you KNOW that your data is nonrecursive in nature,
you may want to serialize such that every node is expressed in full.
(ie as a copy of the original). Setting $YAML::UseAliases to 0 will
allow you to do this. This also may result in faster processing
because the lookup overhead is by bypassed.
THIS OPTION CAN BE DANGEROUS. If your data is recursive, this option
will cause Dump() to run in an endless loop, chewing up your
computers memory. You have been warned.
CompressSeries
Default is 1.
Compresses the formatting of arrays of hashes:
-
foo: bar
-
bar: foo
becomes:
- foo: bar
- bar: foo
Since this output is usually more desirable, this option is turned on
by default.
QuoteNumericStrings
Default is 0. (false)
Adds detection mechanisms to encode strings that resemble numbers
with mandatory quoting.
This ensures leading that things like leading/trailing zeros and
other formatting are preserved.
YAML TERMINOLOGY
YAML is a full featured data serialization language, and thus has its
own terminology.
It is important to remember that although YAML is heavily influenced by
Perl and Python, it is a language in its own right, not merely just a
representation of Perl structures.
YAML has three constructs that are conspicuously similar to Perl's
hash, array, and scalar. They are called mapping, sequence, and string
respectively. By default, they do what you would expect. But each
instance may have an explicit or implicit tag (type) that makes it
behave differently. In this manner, YAML can be extended to represent
Perl's Glob or Python's tuple, or Ruby's Bigint.
stream
A YAML stream is the full sequence of Unicode characters that a YAML
parser would read or a YAML emitter would write. A stream may contain
one or more YAML documents separated by YAML headers.
---
a: mapping
foo: bar
---
- a
- sequence
document
A YAML document is an independent data structure representation
within a stream. It is a top level node. Each document in a YAML
stream must begin with a YAML header line. Actually the header is
optional on the first document.
---
This: top level mapping
is:
- a
- YAML
- document
header
A YAML header is a line that begins a YAML document. It consists of
three dashes, possibly followed by more info. Another purpose of the
header line is that it serves as a place to put top level tag and
anchor information.
--- !recursive-sequence &001
- * 001
- * 001
node
A YAML node is the representation of a particular data structure.
Nodes may contain other nodes. (In Perl terms, nodes are like
scalars. Strings, arrayrefs and hashrefs. But this refers to the
serialized format, not the in- memory structure.)
tag
This is similar to a type. It indicates how a particular YAML node
serialization should be transferred into or out of memory. For
instance a Foo::Bar object would use the tag 'perl/Foo::Bar':
- !perl/Foo::Bar
foo: 42
bar: stool
collection
A collection is the generic term for a YAML data grouping. YAML has
two types of collections: mappings and sequences. (Similar to hashes
and arrays)
mapping
A mapping is a YAML collection defined by unordered key/value pairs
with unique keys. By default YAML mappings are loaded into Perl
hashes.
a mapping:
foo: bar
two: times two is 4
sequence
A sequence is a YAML collection defined by an ordered list of
elements. By default YAML sequences are loaded into Perl arrays.
a sequence:
- one bourbon
- one scotch
- one beer
scalar
A scalar is a YAML node that is a single value. By default YAML
scalars are loaded into Perl scalars.
a scalar key: a scalar value
YAML has many styles for representing scalars. This is important
because varying data will have varying formatting requirements to
retain the optimum human readability.
plain scalar
A plain scalar is unquoted. All plain scalars are automatic
candidates for "implicit tagging". This means that their tag may be
determined automatically by examination. The typical uses for this
are plain alpha strings, integers, real numbers, dates, times and
currency.
- a plain string
- -42
- 3.1415
- 12:34
- 123 this is an error
single quoted scalar
This is similar to Perl's use of single quotes. It means no escaping
except for single quotes which are escaped by using two adjacent
single quotes.
- 'When I say ''\n'' I mean "backslash en"'
double quoted scalar
This is similar to Perl's use of double quotes. Character escaping
can be used.
- "This scalar\nhas two lines, and a bell -->\a"
folded scalar
This is a multiline scalar which begins on the next line. It is
indicated by a single right angle bracket. It is unescaped like the
single quoted scalar. Line folding is also performed.
- >
This is a multiline scalar which begins on
the next line. It is indicated by a single
carat. It is unescaped like the single
quoted scalar. Line folding is also
performed.
block scalar
This final multiline form is akin to Perl's here-document except that
(as in all YAML data) scope is indicated by indentation. Therefore,
no ending marker is required. The data is verbatim. No line folding.
- |
QTY DESC PRICE TOTAL
--- ---- ----- -----
1 Foo Fighters $19.95 $19.95
2 Bar Belles $29.95 $59.90
parser
A YAML processor has four stages: parse, load, dump, emit.
A parser parses a YAML stream. YAML.pm's Load() function contains a
parser.
loader
The other half of the Load() function is a loader. This takes the
information from the parser and loads it into a Perl data structure.
dumper
The Dump() function consists of a dumper and an emitter. The dumper
walks through each Perl data structure and gives info to the emitter.
emitter
The emitter takes info from the dumper and turns it into a YAML
stream.
NOTE: In YAML.pm the parserloader and the dumperemitter code are
currently very closely tied together. In the future they may be
broken into separate stages.
For more information please refer to the immensely helpful YAML
specification available at http://www.yaml.org/spec/.
YSH - THE YAML SHELL
The YAML distribution ships with a script called 'ysh', the YAML shell.
ysh provides a simple, interactive way to play with YAML. If you type
in Perl code, it displays the result in YAML. If you type in YAML it
turns it into Perl code.
To run ysh, (assuming you installed it along with YAML.pm) simply type:
ysh [options]
Please read the ysh documentation for the full details. There are lots
of options.
BUGS & DEFICIENCIES
If you find a bug in YAML, please try to recreate it in the YAML Shell
with logging turned on ('ysh -L'). When you have successfully
reproduced the bug, please mail the LOG file to the author
(ingy@cpan.org).
WARNING: This is still ALPHA code. Well, most of this code has been
around for years...
BIGGER WARNING: YAML.pm has been slow in the making, but I am committed
to having top notch YAML tools in the Perl world. The YAML team is
close to finalizing the YAML 1.1 spec. This version of YAML.pm is based
off of a very old pre 1.0 spec. In actuality there isn't a ton of
difference, and this YAML.pm is still fairly useful. Things will get
much better in the future.
RESOURCES
http://lists.sourceforge.net/lists/listinfo/yaml-core is the mailing
list. This is where the language is discussed and designed.
http://www.yaml.org is the official YAML website.
http://www.yaml.org/spec/ is the YAML 1.2 specification.
http://yaml.kwiki.org is the official YAML wiki.
SEE ALSO
* YAML::XS
AUTHOR
Ingy döt Net <ingy@cpan.org>
COPYRIGHT AND LICENSE
Copyright 2001-2015. Ingy döt Net.
This program is free software; you can redistribute it and/or modify it
under the same terms as Perl itself.
See http://www.perl.com/perl/misc/Artistic.html
|