PdfExtractor.ExtractText
Contenu
[
Cacher
]ExtractText()
Extrait le texte d’un document Pdf en utilisant l’encodage Unicode.
public void ExtractText()
Exemples
Le premier exemple démontre comment extraire tout le texte d’un fichier PDF.
PdfExtractor extractor = new PdfExtractor();
extractor.BindPdf(@"D:\Text\text.pdf");
extractor.ExtractText();
extractor.GetText(@"D:\Text\text.txt");
Dim extractor As PdfExtractor = New PdfExtractor()
extractor.BindPdf("D:\Text\text.pdf")
extractor.ExtractText()
extractor.GetText("D:\Text\text.txt")
Le deuxième exemple démontre comment extraire le texte de chaque page dans un fichier txt.
PdfExtractor extractor = new PdfExtractor();
extractor.BindPdf(TestPath + @"Aspose.Pdf.Kit.Pdf");
extractor.ExtractText();
String prefix = TestPath + @"Aspose.Pdf.Kit";
String suffix = ".txt";
int pageCount = 1;
while (extractor.HasNextPageText())
{
extractor.GetNextPageText(prefix + pageCount + suffix);
pageCount++;
}
Dim extractor As PdfExtractor = New PdfExtractor()
extractor.BindPdf(TestPath + "Aspose.Pdf.Kit.Pdf")
extractor.ExtractText()
Dim prefix As String = TestPath + "Aspose.Pdf.Kit"
Dim suffix As String = ".txt"
Dim pageCount As Integer = 1
While extractor.HasNextPageText()
extractor.GetNextPageText(prefix + pageCount + suffix)
pageCount = pageCount + 1
End While
Voir aussi
- classe PdfExtractor
- espace de noms Aspose.Pdf.Facades
- assembly Aspose.PDF
ExtractText(Encoding)
Extrait le texte d’un document Pdf en utilisant l’encodage spécifié.
public void ExtractText(Encoding encoding)
| Paramètre | Type | Description |
|---|---|---|
| encoding | Encoding | L’encodage du texte extrait. |
Exemples
Le premier exemple démontre comment extraire tout le texte d’un fichier PDF.
PdfExtractor extractor = new PdfExtractor();
extractor.BindPdf(@"D:\Text\text.pdf");
extractor.ExtractText(Encoding.Unicode);
extractor.GetText(@"D:\Text\text.txt");
Dim extractor As PdfExtractor = New PdfExtractor()
extractor.BindPdf("D:\Text\text.pdf")
extractor.ExtractText(Encoding.Unicode)
extractor.GetText("D:\Text\text.txt")
Le deuxième exemple démontre comment extraire le texte de chaque page dans un fichier txt.
PdfExtractor extractor = new PdfExtractor();
extractor.BindPdf(TestPath + @"Aspose.Pdf.Kit.Pdf");
extractor.ExtractText(Encoding.Unicode);
String prefix = TestPath + @"Aspose.Pdf.Kit";
String suffix = ".txt";
int pageCount = 1;
while (extractor.HasNextPageText())
{
extractor.GetNextPageText(prefix + pageCount + suffix);
pageCount++;
}
Dim extractor As PdfExtractor = New PdfExtractor()
extractor.BindPdf(TestPath + "Aspose.Pdf.Kit.Pdf")
extractor.ExtractText(Encoding.Unicode)
Dim prefix As String = TestPath + "Aspose.Pdf.Kit"
Dim suffix As String = ".txt"
Dim pageCount As Integer = 1
While extractor.HasNextPageText()
extractor.GetNextPageText(prefix + pageCount + suffix)
pageCount = pageCount + 1
End While
Voir aussi
- classe PdfExtractor
- espace de noms Aspose.Pdf.Facades
- assembly Aspose.PDF