book

Learning Perl, 7th Edition

by Randal L. Schwartz, brian d foy, Tom Phoenix

October 2016

Beginner

391 pages

10h 38m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Typographical ConventionsCode ExamplesO’Reilly SafariHow to Contact UsHistory of This BookChanges from the Previous EditionAcknowledgmentsFrom RandalFrom brianFrom TomFrom All of Us
Questions and AnswersIs This the Right Book for You?What About the Exercises and Their Answers?What If I’m a Perl Course Instructor?What Does “Perl” Stand For?Why Did Larry Create Perl?Why Didn’t Larry Just Use Some Other Language?Is Perl Easy or Hard?How Did Perl Get to Be So Popular?What’s Happening with Perl Now?What’s Perl Really Good For?What Is Perl Not Good For?How Can I Get Perl?What Is CPAN?Is There Any Kind of Support?What If I Find a Bug in Perl?How Do I Make a Perl Program?A Simple ProgramWhat’s Inside That Program?How Do I Compile My Perl Program?A Whirlwind Tour of PerlExercises
NumbersAll Numbers Have the Same Format InternallyInteger LiteralsNondecimal Integer LiteralsFloating-Point LiteralsNumeric OperatorsStringsSingle-Quoted String LiteralsDouble-Quoted String LiteralsString OperatorsAutomatic Conversion Between Numbers and StringsPerl’s Built-In WarningsInterpreting Nondecimal NumeralsScalar VariablesChoosing Good Variable NamesScalar AssignmentCompound Assignment OperatorsOutput with printInterpolation of Scalar Variables into StringsCreating Characters by Code PointOperator Precedence and AssociativityComparison OperatorsThe if Control StructureBoolean ValuesGetting User InputThe chomp OperatorThe while Control StructureThe undef ValueThe defined FunctionExercises
Accessing Elements of an ArraySpecial Array IndicesList LiteralsThe qw ShortcutList AssignmentThe pop and push OperatorsThe shift and unshift OperatorsThe splice OperatorInterpolating Arrays into StringsThe foreach Control StructurePerl’s Favorite Default: $_The reverse OperatorThe sort OperatorThe each OperatorScalar and List ContextUsing List-Producing Expressions in Scalar ContextUsing Scalar-Producing Expressions in List ContextForcing Scalar Context<STDIN> in List ContextExercises
Defining a SubroutineInvoking a SubroutineReturn ValuesArgumentsPrivate Variables in SubroutinesVariable-Length Parameter ListsA Better &max RoutineEmpty Parameter ListsNotes on Lexical (my) VariablesThe use strict PragmaThe return OperatorOmitting the AmpersandNonscalar Return ValuesPersistent, Private VariablesSubroutine SignaturesExercises
Input from Standard InputInput from the Diamond OperatorThe Double DiamondThe Invocation ArgumentsOutput to Standard OutputFormatted Output with printfArrays and printfFilehandlesOpening a FilehandleBinmoding FilehandlesBad FilehandlesClosing a FilehandleFatal Errors with dieWarning Messages with warnAutomatically die-ingUsing FilehandlesChanging the Default Output FilehandleReopening a Standard FilehandleOutput with sayFilehandles in a ScalarExercises
What Is a Hash?Why Use a Hash?Hash Element AccessThe Hash as a WholeHash AssignmentThe Big ArrowHash FunctionsThe keys and values FunctionsThe each FunctionTypical Use of a HashThe exists FunctionThe delete FunctionHash Element InterpolationThe %ENV hashExercises
SequencesPractice Some PatternsThe WildcardQuantifiersGrouping in PatternsAlternationCharacter ClassesCharacter Class ShortcutsNegating the ShortcutsUnicode PropertiesAnchorsWord AnchorsExercises
Matches with m//Match ModifiersCase-Insensitive Matching with /iMatching Any Character with /sAdding Whitespace with /xCombining Option ModifiersChoosing a Character InterpretationBeginning and End-of-Line AnchorsOther OptionsThe Binding Operator =~The Match VariablesThe Persistence of CapturesNoncapturing ParenthesesNamed CapturesThe Automatic Match VariablesPrecedenceExamples of PrecedenceAnd There’s MoreA Pattern Test ProgramExercises
Substitutions with s///Global Replacements with /gDifferent DelimitersSubstitution ModifiersThe Binding OperatorNondestructive SubstitutionsCase ShiftingMetaquotingThe split OperatorThe join Functionm// in List ContextMore Powerful Regular ExpressionsNongreedy QuantifiersFancier Word BoundariesMatching Multiple-Line TextUpdating Many FilesIn-Place Editing from the Command LineExercises

The unless Control StructureThe else Clause with unlessThe until Control StructureStatement ModifiersThe Naked Block Control StructureThe elsif ClauseAutoincrement and AutodecrementThe Value of AutoincrementThe for Control StructureThe Secret Connection Between foreach and forLoop ControlsThe last OperatorThe next OperatorThe redo OperatorLabeled BlocksThe Conditional OperatorLogical OperatorsThe Value of a Short-Circuit OperatorThe defined-or OperatorControl Structures Using Partial-Evaluation OperatorsExercises
Finding ModulesInstalling ModulesUsing Your Own DirectoriesUsing Simple ModulesThe File::Basename ModuleUsing Only Some Functions from a ModuleThe File::Spec ModulePath::ClassDatabases and DBIDates and TimesExercises
File Test OperatorsTesting Several Attributes of the Same FileStacked File Test OperatorsThe stat and lstat FunctionsThe localtime FunctionBitwise OperatorsUsing BitstringsExercises
The Current Working DirectoryChanging the DirectoryGlobbingAn Alternate Syntax for GlobbingDirectory HandlesManipulating Files and DirectoriesRemoving FilesRenaming FilesLinks and FilesMaking and Removing DirectoriesModifying PermissionsChanging OwnershipChanging TimestampsExercises
Finding a Substring with indexManipulating a Substring with substrFormatting Data with sprintfUsing sprintf with “Money Numbers”Advanced SortingSorting a Hash by ValueSorting by Multiple KeysExercises
The system FunctionAvoiding the ShellThe Environment VariablesThe exec FunctionUsing Backquotes to Capture OutputUsing Backquotes in a List ContextExternal Processes with IPC::System::SimpleProcesses as FilehandlesGetting Down and Dirty with ForkSending and Receiving SignalsExercises
SlicesArray SliceHash SliceKey-Value SlicesTrapping ErrorsUsing evalMore Advanced Error HandlingPicking Items from a List with grepTransforming Items from a List with mapFancier List UtilitiesExercises
Answers to Chapter 1 ExercisesAnswers to Chapter 2 ExercisesAnswers to Chapter 3 ExercisesAnswers to Chapter 4 ExercisesAnswers to Chapter 5 ExercisesAnswers to Chapter 6 ExercisesAnswers to Chapter 7 ExercisesAnswers to Chapter 8 ExercisesAnswers to Chapter 9 ExercisesAnswers to Chapter 10 ExercisesAnswers to Chapter 11 ExercisesAnswers to Chapter 12 ExercisesAnswers to Chapter 13 ExercisesAnswers to Chapter 14 ExercisesAnswers to Chapter 15 ExercisesAnswers to Chapter 16 Exercises
Further DocumentationRegular ExpressionsPackagesExtending Perl’s FunctionalityWriting Your Own ModulesDatabasesMathematicsLists and ArraysBits and PiecesFormatsNetworking and IPCSystem V IPCSocketsSecurityDebuggingCommand-Line OptionsBuilt-In VariablesReferencesComplex Data StructuresObject-Oriented ProgrammingAnonymous Subroutines and ClosuresTied VariablesOperator OverloadingUsing Other Languages Inside PerlEmbeddingConverting find Command Lines to PerlCommand-Line Options in Your ProgramsEmbedded DocumentationMore Ways to Open FilehandlesGraphical User Interfaces (GUIs)And More…
UnicodeUTF‑8 and FriendsGetting Everyone to AgreeFancy CharactersUsing Unicode in Your SourceFancier CharactersDealing with Unicode in PerlFancier Characters by NameReading from STDIN or Writing to STDOUT or STDERRReading from and Writing to FilesDealing with Command-Line ArgumentsDealing with DatabasesFurther Reading
A Short History of Perl DevelopmentPerl 5.10 and BeyondInstalling a Recent PerlExperimental FeaturesTurning Off Experimental WarningsEnable or Disable Features LexicallyDon’t Rely on Experimental Features

Content preview from Learning Perl, 7th Edition

Appendix C. A Unicode Primer

This isn’t a complete or comprehensive introduction to Unicode; it’s just enough for you to understand the parts of Unicode that we present in Learning Perl. Unicode is tricky not only because it’s a new way to think about strings, with lots of adjusted vocabulary, but also because computer languages in general have implemented it so poorly. Perl 5.14 makes lots of improvements to Perl’s Unicode compliance, but it’s not perfect (yet). Each version since then has brought Perl closer to full compliance. It is, arguably, the best Unicode support that you will find, though.

Unicode

The Universal Character Set (UCS) is an abstract mapping of characters to code points. It has nothing to do with a particular representation in memory, which means we can agree on at least one way to talk about characters no matter which platform we’re on. An encoding turns the code points into a particular representation in memory, taking the abstract mapping and representing it physically within a computer. You probably think of this storage in terms of bytes, although when talking about Unicode, we use the term octets (see Figure C-1). Different encodings store the characters differently. To go the other way, interpreting the octets as characters, you decode them. You don’t have to worry too much about these because Perl can handle most of the details for you.