Optical Imaging in Projection Microlithography

Optical Imaging in Projection Microlithography Downloaded from SPIE Digital Library on 25 Feb 2010 to 130.60.68.45. Te...

Author: Alfred Kwok-Kit Wong

225 downloads 1337 Views 9MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Report copyright / DMCA form

DOWNLOAD PDF

Optical Imaging in Projection Microlithography

Downloaded from SPIE Digital Library on 25 Feb 2010 to 130.60.68.45. Terms of Use: http://spiedl.org/terms

Tutorial Texts Series • Optical Imaging in Projection Microlithograpy, Alfred Kwok-Kit Wong, TT66 • Metrics for High-Quality Specular Surfaces, Lionel R. Baker, TT65 • Field Mathematics for Electromagnetics, Photonics, and Materials Science, Bernard Maxum, TT64 • High-Fidelity Medical Imaging Displays, Aldo Badano, Michael J. Flynn, and Jerzy Kanicki, TT63 • Diffractive Optics—Design, Fabrication, and Test, Donald C. O'Shea, Thomas J. Suleski, Alan D. Kathman, and Dennis W. Prather, TT62 • Fourier-Transform Spectroscopy Instrumentation Engineering, Vidi Saptari, TT61 • The Power- and Energy-Handling Capability of Optical Materials, Components, and Systems, Roger M. Wood, TT60 • Hands-on Morphological Image Processing, Edward R. Dougherty, Roberto A. Lotufo, TT59 • Integrated Optomechanical Analysis, Keith B. Doyle, Victor L. Genberg, Gregory J. Michels, Vol. TT58 • Thin-Film Design: Modulated Thickness and Other Stopband Design Methods, Bruce Perilloux, Vol. TT57 • Optische Grundlagen für Infrarotsysteme, Max J. Riedl, Vol. TT56 • An Engineering Introduction to Biotechnology, J. Patrick Fitch, Vol. TT55 • Image Performance in CRT Displays, Kenneth Compton, Vol. TT54 • Introduction to Laser Diode-Pumped Solid State Lasers, Richard Scheps, Vol. TT53 • Modulation Transfer Function in Optical and Electra-Optical Systems, Glenn D. Boreman, Vol. TT52 • Uncooled Thermal Imaging Arrays, Systems, and Applications, Paul W. Kruse, Vol. TT51 • Fundamentals of Antennas, Christos G. Christodoulou and Parveen Wahid, Vol. TT50 • Basics of Spectroscopy, David W. Ball, Vol. TT49 • Optical Design Fundamentals Jor infrared Systems, Second Edition, Max J. Riedl, Vol. TT48 • Resolution Enhancement Techniques in Optical Lithography, Alfred Kwok-Kit Wong, Vol. TT47 • Copper Interconnect Technology, Christoph Steinbriichel and Barry L. Chin, Vol. TT46 • Optical Design for Visual Systems, Bruce H. Walker, Vol. TT45 • Fundamentals of Contamination Control, Alan C. Tribble, Vol. TT44 • Evolutionary Computation: Principles and Practice for Signal Processing, David Fogel, Vol. TT43 • Infrared Optics and Zoom Lenses, Allen Mann, Vol. TT42 • Introduction to Adaptive Optics, Robert K. Tyson, Vol. TT41 • Fractal and Wavelet Image Compression Techniques, Stephen Welstead, Vol. TT40 • Analysis of Sampled Imaging Systems, R. H. Vollmerhausen and R. G. Driggers, Vol. TT39 • Tissue Optics: Light Scattering Methods and Instruments for Medical Diagnosis, Valery Tuchin, Vol. TT38 • Fundamentos de Electro Optica para Ingenieros, Glenn D. Boreman, translated by Javier Alda, Vol. TT37 • Infrared Design Examples, William L. Wolfe, Vol. TT36 • Sensor and Data Fusion Concepts and Applications, Second Edition, L. A. Klein, Vol. TT35 • Practical Applications of Infrared Thermal Sensing and Imaging Equipment, Second Edition, Herbert Kaplan, Vol. TT34 • Fundamentals ofMachine Vision, Harley R. Myler, Vol. TT33 • Design and Mounting of Prisms and Small Mirrors in Optical Instruments, Paul R. Yoder, Jr., Vol. TT32 • Basic Electro-Optics for Electrical Engineers, Glenn D. Boreman, Vol. TT31 • Optical Engineering Fundamentals, Bruce H. Walker, Vol. TT30 • Introduction to Radiometry, William L. Wolfe, Vol. TT29 • Lithography Process Control, Harry J. Levinson, Vol. TT28 • An Introduction to Interpretation of Graphic Images, Sergey Ablameyko, Vol. TT27 • Thermal Infrared Characterization of Ground Targets and Backgrounds, P. Jacobs, Vol. TT26 • Introduction to Imaging Spectrometers, William L. Wolfe, Vol. TT25 • Introduction to Infrared System Design, William L. Wolfe, Vol. TT24 • Introduction to Computer-based Imaging Systems, D. Sinha, E. R. Dougherty, Vol. TT23 • Optical Communication Receiver Design, Stephen B. Alexander, Vol. TT22 -


Optical Imaging in Projection Microlithography Alfred Kwok-Kit Wong

Tutorial Texts in Optical Engineering Volume TT66

SPIE

PRESS

Bellingham, Washington USA


Library of Congress Cataloging-in-Publication Data Wong, Alfred Kwok-Kit. Optical imaging in projection microlithography / Alfred Kwok-Kit Wong. p. cm. Includes bibliographical references and index. ISBN 0-8194-5829-5 (softcover) 1. Microlithography. 2. Imaging systems. I. Title. TK7836.W66 2005 621.3815'31—dc22 2005042537

Published by SPIE—The International Society for Optical Engineering P.O. Box 10 Bellingham, Washington 98227-0010 USA Phone: +1 360 676 3290 Fax: +1360 647 1445 Email: [email protected] Web: http://spie.org Copyright © 2005 The Society of Photo-Optical Instrumentation Engineers All rights reserved. No part of this publication may be reproduced or distributed in any form or by any means without written permission of the publisher. The content of this book reflects the work and thought of the author(s). Every effort has been made to publish reliable and accurate information herein, but the publisher is not responsible for the validity of the information or for any outcomes resulting from reliance thereon. Printed in the United States of America. The International Society

Vil.; for Optical Engineering


Introduction to the Series Since its conception in 1989, the Tutorial Texts series has grown to more than 60 titles covering many diverse fields of science and engineering. When the series was started, the goal of the series was to provide a way to make the material presented in SPIE short courses available to those who could not attend, and to provide a reference text for those who could. Many of the texts in this series are generated from notes that were presented during these short courses. But as stand-alone documents, short course notes do not generally serve the student or reader well. Short course notes typically are developed on the assumption that supporting material will be presented verbally to complement the notes, which are generally written in summary form to highlight key technical topics and therefore are not intended as stand-alone documents. Additionally, the figures, tables, and other graphically formatted information accompanying the notes require the further explanation given during the instructor's lecture. Thus, by adding the appropriate detail presented during the lecture, the course material can be read and used independently in a tutorial fashion. What separates the books in this series from other technical monographs and textbooks is the way in which the material is presented. To keep in line with the tutorial nature of the series, many of the topics presented in these texts are followed by detailed examples that further explain the concepts presented. Many pictures and illustrations are included with each text and, where appropriate, tabular reference data are also included. The topics within the series have grown from the initial areas of geometrical optics, optical detectors, and image processing to include the emerging fields of nanotechnology, biomedical optics, and micromachining. When a proposal for a text is received, each proposal is evaluated to determine the relevance of the proposed topic. This initial reviewing process has been very helpful to authors in identifying, early in the writing process, the need for additional material or other changes in approach that would serve to strengthen the text. Once a manuscript is completed, it is peer reviewed to ensure that chapters communicate accurately the essential ingredients of the processes and technologies under discussion. It is my goal to maintain the style and quality of books in the series, and to further expand the topic areas to include new emerging fields as they become of interest to our reading audience.

Arthur R. Weeks, Jr., University of Central Florida Tutorial Texts Series Editor



A á cO^^-# dedicated to my grandmother



Contents Foreword

xiii

Preface

xv xvii

List of Symbols 1

Basic Electromagnetism

1.1 1.2 1.3 1.4 1.5 1.6 1.7

..... ........... .... Maxwell's equations ..... .. .. .. .. Electromagnetic energy ... The wave equation .. ..... ... .. .. ..... .. .. .. .. Plane waves ... Spherical waves .......... ........... ...... ..... .. ... .... Harmonic waves .. .. Quasi-monochromatic light ... .. .. .. .. .

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

2 Elements of Geometrical Optics

1 1 3 5 6 9 9 14 19

2.1 The eikonal equation .. . . ... . . ......... . . .. .. 2.2 Light rays . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.3 Snell's law .. . . . . . . . . ... . . .. . . .... . .. ... . 2.4 Thin lens .. ... . . . . . . .. . . .. . ...... . . ... . 2.5 Representation of an exposure system . . . . . . . . . . . . . . . 3 Elements of Diffraction Theory

19 20 23 24 26 31

3.1 Qualitative consideration . . ... . . .. . . . ..... . . ... 3.2 Reciprocity . ... . . . . . . .. . . . . . . .. .... . ... . 3.3 The Helmholtz-Kirchhoff theorem . . .. . . ..... . . . .. . 3.4 Fresnel-Kirchhoff diffraction ... . . ... . ..... . . ... . 3.5 The Rayleigh- Sommerfeld diffraction formula ..... . . ... . 3.6 Fraunhofer diffraction . . . . ... . . . . . . . .... . . ... . 3.7 Fraunhofer diffraction patterns .. . ... . . ..... . . ... . 3.7.1 Rectangular pupil . . ... . . .. . . . . ... . . ... . 3.7.2 Circular and annular pupils . . .. . . ..... .. . .. .

ix


31 33 34 35 39 41 45 45 46

x

Contents

4 Imaging of Extended Objects with Finite Sources

51

4.1 Coherent illumination ........ . ............... 4.2 Obliquity factor ........................... 4.3 Spatial correlation of light . ... . ... . . . . . . . .. ... . 4.3.1 Mutual intensity and complex degree of coherence .... 4.3.2 Extended incoherent quasi-monochromatic source . ... 4.3.3 Propagation of the mutual intensity . .. . . . .. . . . . 4.4 Köhler's illumination method .................... 4.5 Partially coherent imaging .... . . . . . ... . . . . . .. . . 5 Resolution and Image Enhancement

51 56 58 58 61 64 65 67 75

5.1 Image intensity spectrum ..... . . ............... 5.2 Binary intensity objects under on-axis illumination . . .. . ... 5.3 Off-axis illumination ........................ 5.4 Attenuated phase-shifting mask .. .. . . . .... . . . .. . .. 5.5 Alternating phase-shifting mask . .. . ... .. . .. . . . ... 5.6 Minimum half-pitch ......................... 5.7 Minimum dimension . . . .... . ... . ... . . .. . ....

76 78 83 85 87 89 90

6 Oblique Rays 6.1 Polarization ............................. 6.2 Vector imaging 6.3 Wave propagation across a dielectric interface .. .. 6.3.1 The laws of reflection and refraction ... .. .. .... 6.3.2 Reflected and transmitted wave amplitudes 6.3.3 Reflectivity and transmissivity 6.3.4 Polarization upon reflection and transmission .. ... 6.3.5 Total internal reflection .. .... .. .. .. 6.4 Stratified media .... ... .. 6.4.1 Basic equations .... .. 6.4.2 Characteristic matrix ... .. ........... ... 6.4.3 Reflection and transmission .. ... .. .. 6.5 Intensity distribution in photoresist .... .. .. .... 6.6 Immersion imaging .. ... ... .. .. ... 6.7 Imaging with oblique rays .. .. .. .. .. ....

97

97 102 107 108 109 112 114 115 116 117 119 122 124 126 127

7 Aberrations

133

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

.

7.1 Diffraction of an aberrated wavefront . . . .. .. . .. .. . . .. 133 7.2 General properties of the aberration function . .. . .. . .. . . . 135 7.2.1 Displacement theorem .. . .... . .. . . . .. . . .. 135 7.2.2 Intensity and average wavefront deformation . ..... . 136 7.3 Zernike polynomials . . . .... . . . . . . .. . . . . . . . . . 137 7.4 Effects on imaging ... ..... . . . . .... . .. . . . . .. 140 Downloaded from SPIE Digital Library on 25 Feb 2010 to 130.60.68.45. Terms of Use: http://spiedl.org/terms

Contents

7.5

xi

Measurement . .... . . . . . . . . . . . . ..... . . . ... 143 7.5.1 Interferometry . . . . . . . . . . . . .... . . . . . . . 144 7.5.2 The extended Nijboer-Zernike approach .... . .... . 144 7.5.3 The Hartmann test . . .. . . . . . . ..... . ..... 146 7.5.4 Aberration monitor patterns . . . . . .... . . . . ... 147

8 Numerical Computation 8.1 Imaging equations . . . . . . . . . . . 8.2 Transmission cross-coefficient integration . . 8.3 Source points integration . . ... . . 8.4 Coherent decomposition . . . 8.5 Object spectrum . . . . ... . . .... . . . . . . ....... . ..... 8.6 Remarks

..... . .... . . . .. . . . .. . . ... . . . . .

151 151 153 155 157 159 162

9 Variabilities 9.1 Categorization ........ ... . ... . . . . . . . . . . . 9.2 Proximity effect ....... ... . . . . . ... . . . . . .. 9.3 Object variabilities (photomask errors) ... . . ... . . . . . .. 9.3.1 Dimensional error . ... . . ... . . ... . . . . . .. 9.3.2 Phase and transmission errors . . . . ... .. . . .. 9.3.3 Edge roughness ....................... 9.4 Polarization effects . . . . . . . . ... . ..... ...... 9.5 Illumination . .. . . . . .. ... . . ... . .. ... . .... . 9.6 Pupil . . . . . . . . . . ... ... . . ... . .. ... . ..... 9.7 Focus . .. . . . . . . . .. . .. . .... . .. ... . ... .. 9.8 Dose . ... . . . . . . ... ... . ..... .. ... . .... . 9.9 Flare ..... . . . . . . .. ... . ..... .. ... . . . . . . 9.10 Remarks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

165 165 167 170 170 174 175 176 177 179 179 182 183 186

A Birefringence

191

B Stationarity and Ergodicity

197

C Some Zernike Polynomials

199

D Simulator Accuracy Tests D. 1 Blank mask . .... ....... . .... . ... .. . ..... D.2 Images of M = 1 systems .. ... . . .. . . ..... . ..... D.2.1 Chromium-on-glass mask under on-axis illumination . . . D.2.2 Dipole illumination of attenuated phase-shifting mask .. D.2.3 Equal line-space on alternating phase-shifting mask . . .. D.2.4 Periodic contacts on chromium-on-glass mask . . . . . .. D.3 Aberrations . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

205 205 207 207 208 209 209 210

..

..

..

..

..

..

..

..

..

..

..... ..... ..... . ..... . ... . ..

..

. . . . . .

..

..

..

..


xii

Contents

D.4 Finite number of source points . .. . ..... . . .. .. . . . . 210 D.4.1 Resist image with aerial coupling medium . .. . . . . .. 210 D.4.2 Immersion imaging . . . .. . . ... . . ... . . . . .. 212 E Select Refractive Indexes

215

F Assorted Theorems and Identities

217

Bibliography

219

Solutions to Exercises

231

Index

249


Foreword Lithographers have pushed optical projection printing well beyond the imagined limits. Even microwave engineers, quantum physicists, and photonic scientists would not have expected the majority of integrated circuit chips in existence today to have feature sizes below the fundamental half-wavelength limit of waveguide modes, eigenstates, and stop bands. This has been accomplished by the introduction of innovations by many technologists. These innovations include off-axis illumination, phase-shifting masks, measurement of aberrations and flare, high-NA imaging, immersion, resist coatings, and photomask precompensation for optical and even process effects. Equally as important has been the ability to integrate these innovations as part of a complete system and simultaneously optimize the interplay of their key parameters. Here for the first time is an integrated mathematical view of the physics and numerical modeling of projection printing that efficiently covers the full spectrum of the important concepts. This book is far broader than the material found in any optical text or reference book. Alfred Wong works from his firsthand involvement in semiconductor manufacturing and his interest in theoretical concepts. His writing is like the circuits he once designed in that it performs the desired function quickly, without wasted energy, space or glitches. Alfred Wong pulls together the diverse aspects in a common framework with a solid physical foundation. The framework is used to give intuitive explanations, models for quantitative characterization, tolerances for controlling variations and insight into simulation methodologies. The broad scope includes many second-order effects that dominate production and design for manufacturing strategies. Specifically, the value of this book is that it systematically redevelops and extends in one notation the rather challenging and extensive set of theoretical concepts used in advanced projection printing systems and their numerical simulation. In a sense, it is all of those one-assumption-at-a-time steps and messy equations that there is never time nor space for in a conference paper. By starting from the basics of Maxwell's equations, ray-tracing and diffraction, the important conceptual elements associated with them are nicely summarized. The extension of the formulation of imaging to the full optical system gives a particularly clear treatment of several advanced concepts. These include the relationship of the illumination spec-

xiii


xiv

Foreword

trum to the spatial degree of coherence across the mask, the simplification of the formulation when the image only depends on differences in distances, and when the diffraction efficiency of the mask is independent of the angle of incidence. Through a simplifying assumption about the mask spectrum, the formulation is extended to give both new and previously known equations for image modulation, cut-off limits, mask requirements, and unwanted side effects. Full formulations are made for the new high-NA challenges of polarization, vector imaging, resist materials and immersion. Both aberration effects on imaging and the advanced theoretical concepts for monitoring them are considered. The parallel treatment of the Abbe, the Hopkins, and the more advanced sum of coherent system approaches for image calculation clarifies their methodologies and advantages. The identification and parameterization of some eleven sources of variability in imaging is in itself a framework for characterizing exposure tools. Readers with a good working knowledge of calculus can follow through the step by step development. A technologist may want to get the general idea of each concept and then skip ahead to the key characterization equations that result. Even the casual reader will gain a perspective on the key concepts and this will likely help facilitate dialog among technologists much in the way that the kl and k2 parameters of Burn Lin have done. This book is a tribute to the advanced concepts and innovation used in the field of projection printing over the last 30 years. Alfred Wong offers rigorous underpinning, clarity in systematic formulation, physical insight into emerging ideas, as well as a system level view of the parameter tolerances required in manufacturing. This book is very much the "Born and Wolf of Projection Printing" and Alfred is our seminal chanticleer for the concepts that support this practice.

ANDREW R. NEUREUTHER


Preface Optical projection lithography will remain the predominant microlithography technology in the foreseeable future. With 193 -nm radiation and an immersed numerical aperture of 1.38, the k1 factor of a 45 -nm feature is 0.32. Fabrication at such low kl factors requires both image enhancement and tight control of process fluctuations. A prerequisite to successful resolution improvement and variability control is an understanding of optical imaging fundamentals. This book aims to explicate the principles of image formation in projection microlithography, balancing intuitive understanding with mathematical rigor such that the readers can both distill the essence of the physics and form a firm foundation from which imaging techniques can be analyzed and developed. Chapter 1 derives the properties of light that are relevant for analysis of image formulation in photolithography. From Maxwell's equations we deduce that light is a transverse wave, with the electric and magnetic field vectors vibrating in a plane that is normal to its direction of propagation. When light interacts with objects whose physical dimensions are large compared with its wavelength, we can neglect the field vectors under many circumstances, and approximate Maxwell's equations by laws formulated in the language of geometry. This topic of geometrical optics is treated in Chapter 2. To describe light transmission through apertures whose dimensions are comparable to or smaller than the wavelength, however, we need to resort to diffraction theory, a subject we discuss in Chapter 3. Photomasks used in optical lithography require illumination by light sources that are physically extended. Despite incoherence between source points making up the extended source, vibrations at different object points are correlated due to diffraction of the illumination optics. Chapter 4 develops the concept of spatial coherence and the associated mutual intensity function that enable mathematical description of partially coherent imaging scenarios. The resulting equations are used in Chapter 5 to examine the theoretical and practical limits of the minimum dimension and the minimum half-pitch. Based on the foundation of the first five chapters, we further our development to address topics that are becoming crucial as microlithographers push the limits of optical imaging. The use of high-numerical-aperture lenses necessitates consideration of the directional nature of light vibrations. Chapter 6 formulates the vector

xv


xvi

Preface

theory of imaging that is applicable for immersion lithography in the presence of a stratified wafer stack. Simultaneous with increasing numerical aperture are stringent aberration requirements. The impact of lens aberrations is explored through diffraction theory in Chapter 7. Our abilities to harness the power of affordable computers to predict images of object patterns, and to optimize the photomask and exposure configuration given a desired image are becoming indispensable. Chapter 8 discusses common numerical approaches for imaging simulation. Variability control is also integral for successful low-k1 lithography, as both layout shapes and image tolerance are shrinking rapidly compared with í1.o/NA. Chapter 9 discusses significant causes of patterning nonuniformity arising from optical imaging, and techniques for their measurement. I am thankful to many friends and colleagues during the course of this project. In the first place, I am grateful to Dr. Anthony Yen for encouraging me to write a text on this topic. I am indebted to Dr. Timothy Brunner, Dr. Gregg Gallatin, Professor Andrew Neureuther, Dr. Alan Rosenbluth, Dr. Frank Schellenberg, and Dr. Yen for their comments and their meticulous review of the manuscript. I am much beholden to my dissertation advisors, Professor Andrew Neureuther and Professor William Oldham, for introducing me to microlithography and for their lessons of wisdom. It is an honor to have the Foreword of this book written by Professor Neureuther. I am obliged to Dr. Gallatin and Dr. Yen for their suggestions on development of the Rayleigh- Sommerfeld diffraction formula in §3.5, and to Dr. Rosenbluth for his exposition of the obliquity factor in §4.2. I would also like to acknowledge Dr. Wilhelm Ulrich's permission for reproduction of the illustration in Fig. 2.6. Publication of this book is the culmination of years of work by the SPIE Press staff, to whom I owe much thanks. I have many fond memories in writing this text, as my wife Aida and I often agonize side by side on our respective writings. I hope the readers will also enjoy this book, and privilege me with suggestions for improvement.

ALFRED WONG KWOK-KIT


List of Symbols SI units are used unless otherwise stated. Real quantities are generally denoted by small letters, complex quantities by capitals, and vector quantities are expressed in bold type. There may be exceptions where confusion is unlikely to arise.

S u, v U, V

electric field electric displacement magnetic field magnetic induction electric current density field phasors electric charge density electric conductivity permittivity magnetic permeability refractive index; integer extinction coefficient characteristic impedance electric energy density magnetic energy density Poynting vector scalar field components field component phasors

w v k k k c Atcoh Alva h

angular frequency (temporal) frequency wavelength wave number wave vector speed of light coherence time coherence length

cp(r)

optical path

e d h b j E, D, H, B, J Pe

£e µm

n K

We Wm

xvn Downloaded from SPIE Digital Library on 25 Feb 2010 to 130.60.68.45. Terms of Use: http://spiedl.org/terms

xviii

Symbols

f

focal length M, Miaterai lateral magnification (positive quantity) Mangi i ar angular magnification longitudinal magnification Mlongitudinal I

R(z, y) H(f, g) G(f g) E(pe) ,

,

intensity optical system transfer function coherent frequency transfer function auto-correlation function of H(f g) obliquity factor mutual coherence function complex degree of coherence mutual intensity complex degree of coherence object function (field) object spectrum effective source transmission cross-coefficient normalized spatial coordinates normalized frequencies ,

F(PI PZ; ti) ,

y(PI,P2;'t)

f (Pl, P2) µ(P1, P2)

0(x,9) Ó(f, g) f(f, g) ,

TCC(f', g'; f", g")

x, y f ,g P Pe

p=

f2

+gz

Pe= psin8obj

6 ob NA

semi-aperture angle at the image plane numerical aperture

p

tfg

pattern period (pitch) minimum resolvable pitch minimum resolvable half-pitch normalized half-pitch of interest normalized dimension of interest object background transmittance (field) object foreground transmittance (field)

P i, i, ums , µyx J p1, p ti j TII R T M

degree of polarization; a point complex correlation factor coherency matrix reflection coefficients (field) transmission coefficients (field) reflectivity (intensity) transmissivity (intensity) characteristic matrix of stratified medium

pn,in h n,i„ kIhalf pitch k1; d

tb g

,


Symbols

xix

(D A I rms Z^

aberration function root-mean-square deviation of wavefront jth Zernike polynomial

sine (x)

sine function: sine (x) = sin (ltx) / (7tx) Bessel functions circle function quadrant function

J, (x)

circ(f , g, 6) Q(x, y)

Do Dprint 'threshold

R. U. r r n 1 dl dS dV

C S V Vx V. V V2

dose to clear positive resist or harden negative resist dose supplied in exposure intensity threshold Rayleigh unit of defocus

distance position vector unit vector unit surface normal unit tangent differential curve element differential surface element differential volume element curve surface volume curl divergence gradient Laplacian


Chapter 1

Basic Electromagnetism From the corpuscular theory in which light propagates in the form of minute particles, to the wave theory that elucidates diffraction phenomena, to the quantum theory in which both the wave and corpuscular theories are simultaneously valid, humankind's concept of light has evolved much over the last two hundred years. The principles of optical projection lithography with which we are concerned were substantially formulated before the twentieth century, prior to the general theory of relativity, which stipulates the bending of light rays by gravitational fields. By that time, Augustin Jean Fresnel (1788-1827) had laid the wave theory of light on a firm foundation, and James Clerk Maxwell's (1831-1879) conjecture that light waves are electromagnetic had been verified by Heinrich Hertz (1857-1894). In the first three chapters of this text, we review properties of light that are relevant for analysis of image formation in photolithography. Starting with Maxwell's equations, we deduce the characteristics of light in this chapter. We shall learn that light is a transverse wave, with the electric and magnetic field vectors vibrating in a plane that is normal to its direction of propagation.` When light interacts with objects whose physical dimensions are large compared with its wavelength, we can neglect the field vectors under many circumstances, and approximate Maxwell's equations by laws formulated in the language of geometry. This topic of geometrical optics is treated in Chapter 2. To describe light transmission through apertures whose dimensions are comparable to or smaller than the wavelength, however, we need to resort to diffraction theory, a subject we discuss in Chapter 3.

1.1 Maxwell's equations We are primarily interested in two attributes of light: its intensity and the manner in which it propagates. To derive these properties we begin with Maxwell's 'Stratton [1], Jackson [2], and Landau, et al. [3] provide more in-depth discussions on electromagnetism.


2


equations: ii

Vxe=

a

b

—

t

Vxh= ad +j 0•d=p e V•b=0

Faraday's law,

(1.1)

generalized Ampere's law,

(1.2)

Gauss's law, nonexistence of magnetic charges

(1.3) (1.4)

The field is customarily represented by the electric field e and the magnetic induction b.iii To study the effects of the electromagnetic field on materials, we need the additional quantities known as the electric displacement d, magnetic field h, electric charge density Pe, and electric current density j, quantities we shall describe shortly. Since V • V x h - 0 (see Exercise 1.1) Eqs. (1.2) and (1.3) imply that ( 1.5)

+0j=0.

Equation (1.5) is the continuity equation. It means that the total charge within a volume changes only by inflow and outflow of electric currents. Electric charge is conserved. Using the divergence theorem (see Exercise 1.4) and Stoke's theorem (see Exercise 1.5), we can transform Eqs. (1.1) and (1.2) into integral forms:

C

áb •ndS, =—f( s

yh•1dl

=

i

e•ldl

)

c

Js

(ád +j )

(1.6)

•ndS.

°Equations (1.1)—(1.4) are expressed in the MKSA (meter-kilogram-second-ampere) system. Many texts on electromagnetism use Gaussian units in which Maxwell's equations are Vxe =- 1

ab,

lad 4it =cat+ cj' V-d= 4itp e ,

Vxh

,

V•b =0. We adopt the MKSA equations because of their overall convenience in exposition of large-scale phenomena. The Gaussian system is more suitable for microscopic investigations involving individual particles and relativistic phenomena. Interested readers may wish to consult [4] for a discussion on units and dimensions used in electromagnetism. iiiThe field vectors e and b are often denoted by capital bold letters E and B in other expositions. In this text, we use small bold letters to represent real vector quantities and reserve the bold, capital representation for phasors (see § 1.6 and xvii).


3

Electromagnetic energy

These equations are useful for developments in later chapters, when we need to derive boundary conditions for electromagnetic fields across dielectric discontinuities (see §8.5 and Exercise 6.9). To determine the field vectors within a medium, Maxwell's equations must be supplemented with constitutive relations (also called material equations), describing the behavior of materials under the influence of an electromagnetic field. These relationships may be nonlinear and may exhibit hysteresis; they frequently involve tensors even in linear media. For time-harmonic fields (fields that exhibit a sinusoidal dependence on time, see § 1.6) in isotropic materials, however, the constitutive relations take the following simple form: j = 6,e d = Ee e, b=y,n h,

(1.7) (1.8) (1.9)

Ohm's law,

where 6, is the conductivity of the medium, Ee is the permittivity (also known as the dielectric constant), and um is the magnetic permeability ?'

1.2 Electromagnetic energy An electromagnetic wave carries power, exchanging energy and momentum with charges and particles. The force f exerted by a field (e, h) on a particle of charge q traveling at a velocity v is given by the Lorentz force equation: f= q(e +vxb).

(1.10)

Charges move in response to the Lorentz force, and gain energy at the expense of the field. The rate of work done by the field is Pq =v•f =qv•e.

For a continuous charge and current distribution with a charge particle density of n, j = nqv,

the rate of work done by the field in a volume V becomes P=

Jv (j•e)dV = vf a,1eI

2 dV.

(1.11)

This power represents conversion of electromagnetic energy into mechanical or thermal energy. For photolithography applications, this is the energy deposited into photoresists. '°Anisotropic materials are briefly treated in Appendix A.



4

The energy imparted must be balanced by a corresponding decrease in electromagnetic energy. To demonstrate this energy conservation, let us combine Eqs. (1.1) and (1.2) into the following form: e•Vxh—h•Vxe=e•

at +j e. at +h • ab

ad

(1.12')

Since e•Vxh—h•Vxe=—V•(exh) (see Exercise 1.3), we can deduce Poynting 's theorem of energy conservation:

A

ab

at

v

v

(1.12)

s

We obtained Eq. (1.12) by integrating Eq. (1.12') over a volume V bounded by the surface S with unit outward normal n, and applying the divergence theorem. Substituting the constitutive relations [Eqs. (1.7)—(1.9)] into Eq. (1.12) gives (see Exercise 1.6)

dW + v f (j•e)dV + sf (exh)•ndS=0. ,

(1.13)

The quantity W

= f(We +Wni ) dV

v represents the total electromagnetic energy contained within the volume V; and we

= 2 (e • d)

and

wm

= 2 (h • b)

(1.14)

are the electric and magnetic energy density of the field. The first term of Eq. (1.13) stands for the change in electromagnetic energy within the volume V. This energy change arises from two sources. One stems from the work done by the field on the material within the volume; this is represented by the second term of the equation [same as Eq. (1.11)]. The other is the flow of energy across the volume boundary S; this amount is given by the third term. For light propagation in nonabsorbing media (6, = 0), the work done is 0 and Eq. (1.13) becomes

a

+V 5=0,

(1.15)

where W = We + Wm

S = e x h

is the total electromagnetic energy density, and is the Poynting vector.


5

The wave equation

Comparing Eq. (1.15) with the continuity equation [Eq. (1.5)], we can interpret the Poynting vector S as a flow of energy density. It represents the amount of energy that flows across a unit area normal to the directions of e and h per unit time. We shall derive in § 1.4 that the magnitude of the Poynting vector is a measure of the light intensity, and the direction of S represents the direction of light propagation. There are thus two candidates for the label "intensity": (j • e), which represents the rate at which energy is deposited into photoresists or any observation medium, and le x hi, which denotes the energy flux associated with light traveling in nonabsorbing materials. From Ohm's law [Eq. (1.7)] and a later derivation regarding the Poynting vector [Eq. (1.28)], however, the two quantities lexhi_ £elelt

j•e=6,iel 2 and

,gym

are identical apart from a multiplier that is immaterial given an arbitrary overall normalization [see Eq. (4.9) in §4.1]. We can use "intensity" to mean both (j • e) and i e x hi, although we should note that, strictly speaking, intensity in transparent materials refers to the Poynting vector, while the photoresist exposure dose is related to (j • e).

1.3 The wave equation Maxwell's equations relate the field vectors by a set of cross-coupled simultaneous differential equations. To understand the propagation characteristics of light we need to obtain differential equations that the electric and magnetic fields satisfy separately. Taking the curl of Eq. (1.1), and using the constitutive relation for the magnetic induction b [Eq. (1.9)], we have VxVxe+Vx

a at h

=0.

We can transform this equation into one that involves only the electric field (see Exercise 1.7): z ate Oe— £eFlmát2 +vum X

a(,ue) ( EQ1 e•^£e1J =0. (1.16) 1 OXe ) —,G^m

J

at +O

(UM

A similar equation can be derived for the magnetic field: 2

V2 h — £ e,um 2 á h + V £ e X ( 1v x h) + v

(gym h • V um) = 0.

(1.17)

In a homogeneous medium that contains neither electric charges nor currents, Eqs. (1.16) and (1.17) become Z

ate

O e — £ e fl m á t2 =0, 2 (1.18)

áh =0.

^ 2 b — £e1Um t



x

charges infinitely

far away

Figure 1.1: A plane wave propagating in thes-direction. This set of wave equations suggests that electromagnetic waves propagate with a speed (see Exercise 1.8)

c=1// Ee um = 1 / ErEO,U yo

(1.19)

=cØ/\/ erur = c0/n,

where

co =

7 F m is the permittivity in vacuum, £o 4^ccz0 x 10+ uo=47tx10 -7 Hm' is the permeability in vacuum, E r = Ee /EO is the relative permittivity, gr =,urnIPO is the relative permeability, n = Frµr is the refractive index, and 1//2.99792458x 10 8 m s -1 is the speed of light in vacuum.

1.4 Plane waves The wave equations of Eq. (1.18) imply that each Cartesian component of the electromagnetic field u(x,y,z,t) = u(r,t) satisfies the homogeneous differential equation z V2u _ c át2 = 0.

(1.20)

Imagine a field arising from excitations caused by electric charges that are infinitely far away, such that the direction from the charges to any observation point r =


7

Plane waves

(x, y, z) can be considered fixed. The field can then be represented by

u = u(r • s, t),

where s = (sr , sy , sZ ) is the unit vector denoting the fixed direction from the charges. The functional dependence on r •s means that the spatial variation of the field depends only on the projection of the position vector r onto the direction s. Rotating the axes from {x, y, z} to {n, , x} such that the X-direction coincides with s, we have and u(r•9,t) = u(x,t). r•9 = x This situation is illustrated in Fig. 1.1. With these axes rotation Eq. (1.20) becomes (see Exercise 1.9) 1 Ó2 u a2u (1.21) ßx 2 — c 2 ßt2 = 0 The general solution to Eq. (1.21) is of the form (see Exercise 1.8)

u = u + (x—ct)+u_(%+ct) =u + (r•s—ct)+u_(r s"+ct),

(1.22)

where u + is an arbitrary function of (r • s — ct) and u_ of (r • s + ct). The solution u + represents a field that propagates in the +x-direction with a speed of c, while u_ represents a field that propagates in the —x-direction with a speed of c. The overall solution u is a general plane wave because at each instant, u is constant on each of the planes: r •s = constant. The vector s" represents the propagation direction of the plane wave. Since each Cartesian component of the field is a function of (r • "s A ct) only, the field vectors e and h also exhibit this sole dependence. Consider a wave propagating in the +x-direction, such that e=e(r•9—ct) =e(^)

and

h=h(r•s—ct) =h(C).

We can derive the following relationships (see Exercise 1.10): Vxe=sx de

and

Vxh=sx dh.

(1.23)

Substituting Eq. (1.23) into Maxwell's equations [Eqs. (1.1)—(1.2)] results in s><

de

dh

=ccm and

s x d _ —CE e ä- .

(1.24)



8

Integrating Eq. (1.24) with respect to and setting the background field to zero gives e•9=h•"s=0,

(1.25)

sxe= pm h, £e

(1.26)

and s" x h = —

e.

(1.27)

F^F—k-, These equations indicate that the electric and magnetic field vectors lie in a plane that is normal to the direction of propagations, and that they are mutually orthogonal. Their magnitudes are related by

ei = £é Ihl =iIhi, where 11= u m /£e is the characteristic impedance of the medium of propagation. In vacuum, Tb ,uo/£o = 376.7 Q. The Poynting vector [Eq. (1.15)] associated with this plane wave is S=exh = IeI 2 s = ilhl 2 s

(1.28)

= n I e i 2 ss . ,umcO

Since the total energy density [Eq. (1.14)] is W =W e +W m

= £e Ie I 2 = Umihi 2,

the Poynting vector can also be expressed as S

=

cws".

(1.29)

The physical meaning of Eq. (1.29) is illustrated in Fig. 1.2. A cylinder with a crosssectional area of unity and a height of c lies with its axis parallel tos. The amount of energy that crosses the base of the cylinder per unit time is the energy contained within the cylinder. The energy flux is cw. Comparing this quantity with that in Eq. (1.29), we can readily interpret the Poynting vector as the amount of energy that crosses, per unit time, an element of unit area perpendicular to the direction of propagation. The Poynting vector lies in the propagation direction, and its magnitude is the light intensity.


Spherical waves

0

x

Figure 1.2: The magnitude of the Poynting vector S is equal to the volume of the cylinder multiplied by the energy density.

1.5 Spherical waves A light field produced by a point source located at Po = (xo, yo, zo) can be described by general spherical waves, whose sole spatial dependence is the distance r = (x — xo) 2 + (y — yo) 2 + (z — zo) 2 from Po: u = u(r, t).

Using the relation V2u

= Y arz (ru)

(1.30)

(see Exercise 1.11), the wave equation becomes 2 z -r2 (ru)

--(ru) = 0.

— c it2

This equation is identical to the equation for a plane wave [Eq. (1.21)], with x replaced by r and u by ru. The solution of the spherical wave equation is therefore u(r t) = u+(r r

—

ct) + u_(r+ct) . r

(1.31

)

The function (u + lr) represents a general spherical wave diverging from Po and the solution (u_/r) describes a general spherical wave converging toward Po. The propagation speed is c for both waves.

1.6 Harmonic waves In the last sections we learned that light intensity is proportional to the quantity ei 2 , and that light propagation can be described, among many representations, by plane


10


or spherical waves. The discussion has made no assumption about the composition of light. A general light field comprises different colors (frequencies) in arbitrary proportions. On the other hand, exposure light used in optical lithography is narrowband; it can be characterized by an average color together with shades that are very close to the average color. In this section and the next, we explore simplifications to the wave equation [Eq. (1.18)] that are permitted for analysis of light with narrow spectra. Let us first examine monochromatic waves. If we focus on a spatial point (xo, yo, zo) and observe the time dependence of the field

u(xo,yo,zo,t) = v(t), the function v(t) is sinusoidal, namely, (1.32)

v(t) = vocos(cot+0)

with vo and 0 being constants. Under this circumstance, we can associate with the wave a frequency v and a period T such that _ co1 _

V 27t T' where w is the angular frequency signifying the number of sinusoidal vibrations per 2ar seconds. This type of wave is called time-harmonic." The expression for a harmonic plane wave of magnitude I propagating in the direction is (1.33) u(r,t) = uocos [(-i —t) + . ]

If we let

21rc 2rtco 1 = XO

cw

co n

n Eq. (1.33) is unchanged when (r • s) is replaced by (r • s + X). The quantity X is the wavelength of the plane wave in the propagating medium (with a refractive index n), and X0 is the wavelength in vacuum. Besides the wavelength, it is customary to represent a time-harmonic wave by the wave number denoting the number of sinusoidal periods per 2tt meters: k =

2^ w 2nn _ _ = nko.

(1.34)

In terms of the wave number, the harmonic wave can be written as u(r,t) = uocos(k•r—cot+0), °Harrington [5] provides a more detail discussion of time-harmonic waves.


11

Harmonic waves

where the wave vector k = k"s = (kx , ky , kZ ) points in the propagation direction. The wave number and its Cartesian components are related by

k2 =k2 +k +1 .

(1.35)

The electromagnetic field becomes e(r,t) = (e x , ey , ez )

h(r,t) =

and

with the Cartesian components being

ei(r,t) =aeï cos(k•r—cat +Oe i ),

and hi(r,t) =ah,cos(k•r—wt+Oh i ), i = {x,y,z}. Computation involving harmonic waves can be simplified by using exponentials instead of trigonometric functions. Denoting the real part of a quantity (•) by Re{.}, we define

t) = Re {Ei (r)e —«)t }

and

hi (r, t) = Re{Hi (r)e

Ei ( r) = ae^ e+i(r-k +Oe.)

and

Hi(r) = ah.e+i(r-k+phi).

ei (r,

where

—

i0)t },

With these representations, e(r, t) =

E(r)e— iwt + E* (r)e+ iwt 2

and H(r)e—`wr + H* ( r ) e h(r,t) = 2

t

where E(r) = [EX (r),Ey (r),EZ (r)] and H(r) = [HX (r),Hy (r),Hz (r)] are complex vectors, and the asterisk * denotes complex conjugation. Figure 1.3 graphically illustrates the relationship between a field component ei(r,t) and its exponential representation Ei(r)e iwr Also called phasors, the complex exponentials E(r) and H(r) are useful in the calculation of energy associated with an optical field. Since optical frequencies are high (on the order of 10+ 15 Hz), we are interested in the time-average of the field rather than its instantaneous values in applications where the observation duration is long compared with the wave period (on the order of 10 -15 s). Denoting the timeaverage of a quantity (•) by < • >, we can express the time-averaged electric energy —


=

12


imag e (r,t) ',

t

;

real E (r)exp(-i cot) ;

Figure 1.3: The value of a field component is the projection of its phasor representation onto the real axis.

density as +t'/2

<we >

1

/

J

+t'/2

w e dt

=

1

/ £e

J 2 e•edt

i

—t'/2

—t'/2

+t'/2

gt^

^-+E E * E *e +i2wt + 2E • E ] dt,

1i2ü)t e

J £ [E • Ee

-

—t'/2

where t' is the measurement duration. Since (1/w) «t', terms involving the factors eti2°°t integrate approximately to zero, and the average electric energy density is +t'/2

<We >=8t7 f 2E•E* dt

—t'/2

(1.36)

Ee E • E * _ Ee IEI 2 2 2 2 2 •

The time-averaged magnetic energy density and Poynting vector can be derived in an analogous manner: < wm > = 4 H • H*,

(1.37)

and

< S > = Z Re{E x H*}.

(1.38)

In a nonconductive medium in which no work is done by the field on charges and particles, we can deduce from Eq. (1.15) that

v.<s>=o. Application of the divergence theorem gives

f

< S > •ndS = 0.

(1.39)

S


Harmonic waves

13

The time-averaged flux through any closed surface is zero. In the absence of absorption or amplification the time-averaged amount of energy entering a volume must equal that leaving. The conclusion drawn in Eq. (1.39) is invalid for absorbing materials such as photoresists; the field amplitude decreases as a wave propagates. The wave equation in such a medium can be derived from Eqs. (1.1) and (1.2) (see Exercise 1.12): V

ZE

M a 2 E = µmEé át2 (lmóe ^t .

(1.40)

For a harmonic wave, we can replace a/at by —ko, and Eq. (1.40) becomes V 2 E+W2µm(£é+i 6, )E=0 or

V2E+k2E=0, where

(1.41)

6

k2 = W2M m (£e + i

c

C^

We can identify k as the complex wave number and ß

£e =£ e +i , =£e+ i £eƒ CV

as the complex permittivity. In analogy to Eqs. (1.34) and (1.19), we can define a complex refractive index n and a complex propagation speed c by CO

n=—k=n'(1+ix)

and

c= 1

_CO

co £eiUm

n

where x is the extinction coefficient or attenuation index. From these relations we can express the real and imaginary parts of the refractive index as (see Exercise 1.13) n

fl' 2(2 e

2

2 = cc

m

( ^2+Ez+£')=c2m e e e

2re2

(^

e

+^^2+£e

É 2 +Ee 2— £'ee )= Cim ( \/e 2 +(

c

(1.42)

) 2 £e /I

With these complex quantities, the wave equation of Eq. (1.41) is formally identical to that of Eq. (1.18). The harmonic field solution is U(r, t) _ Uo e+`( 'r—wt) =

Uo e +iw [ ó (s.r)—t ] e — ó n^x(s'r )


14


This is a plane wave with wavelength k_ 27cco 1 _4 co n' nr

and an exponential attenuation characterized by the factor conh x = kon'K. co

The energy density decreases exponentially according to W

= Wo e x(s'r) -

where the absorption coefficient x is 4 tK 41cn'x

1.7 Quasi-monochromatic light Monochromatic light described by a single harmonic wave is a mathematical idealization. A realistic light field should be regarded as a superposition of monochromatic waves of different frequencies. Let us first consider a wave formed by the superposition of two monochromatic plane waves of magnitude 1/2 and of angular frequencies Th + Ow/2 propagating in the z-direction: U(z,t) = 2 ( e

+i[(có—Ao)/2) (z/ß—r)] ,

z = cos 2

[ ^^

+e +i[(w +om /2)(z/c—t)]1

J

1

(1.43)

( --t, ] e+`,-(z/c—t)

c

with U (z, t) being the phasor representation of the actual field u(z, t). The difference between the wave described by Eq. (1.43) and a monochromatic wave of angular frequency ch is illustrated in Fig. 1.4 for Aa) /th = 0.1. The monochromatic wave [Fig. 1.4(a)] vibrates 2tw times per second; its magnitude is constant. Although the wave described by Eq (1.43) also vibrates 2itw times per second [Fig. 1.4(b)], its amplitude varies with position and time. If Aw « w, the factor (A w/2) (z/c — t) varies slowly compared with ch(z/c — t), and the field consists of one wave train after another. The duration of each wave train is Ot =

2n

Aco


Quasi-monochromatic light

15

(a)

wave train

21E/Ow

Figure 1.4: Field vibration at a fixed point as a function of time for (a) a monochro-

matic plane wave of angular frequency ci, and (b) a wave formed by the superposition of two monochromatic plane waves with an angular frequency difference of A w. Let us generalize our analysis to light composed of an infinite number of harmonic plane waves specified by the amplitude function A (w) such that U(z, t)

0

If the refractive index does not vary with frequency, namely, the linear relationship between the wave number and angular frequency in Eq. (1.34) holds: c)

C

=

= fA(w) e+i_t)dw.

n(u

CO

the field is U(z

t) =

f A(w)e

+ti Ò

(C `) dw -

0

(1.44) J

A(w)e

duw = U(z', 0),

0

where z' = z — ct and U(z, 0) is the field distribution at t = 0. This equation implies that, regardless of the amplitude function A(w), the initial field distribution propagates at a speed c without distortion. An example is illustrated in Fig. 1.5(a).


16


U NO z

t=t'>0

t

(a) nondispersive medium

(b) dispersive medium

Figure 1.5: (a) Waveforms propagate undistorted in a nondispersive medium (the refractive index of which is not a function of frequency). (b) The field distribution

spreads as a wave travels through a dispersive medium. In reality the refractive index is a function of frequency; the relationship between the wave number k and angular frequency w is nonlinear. The field should be expressed as =

U(z,t)

J A(o))e+'[w n(0

ó —wt] dw.

(1.45)

0

If the amplitude function A(co) is peaked at some value ?ii, the quantity w • n(o) in Eq. (1.45) can be expanded as a Taylor series at w:

r

w•n(Co) =c5•n(c5)+

n(U5)-F-^ dn(w) (cn—^)+... dco w=^

w=w

The field is approximately _2

U(z,t) -e co

w

j + iw

do)

A(co)e

n (ü)Î w

dn(O

z —t

w=w1 CO do

0 —i

= e Co

Ctj2 do w

do)

/

o)=w

with zZ = z— Vg t.

U(Z , 0),

'

Compared with Eq. (1.44), we can recognize that, apart from a phase factor (that is unity if dn(w)/dcn = 0), the waveform propagates with a group velocity of

Vg =

CO

n((ii)+wdd( ^ w=w ^


Quasi-monochromatic light

17

Although we can identify the group velocity as the speed at which the electromagnetic energy associated with a wave travels, the waveform smears as the wave propagates because the various frequency components of the wave undergoes different amounts of phase changes. To demonstrate mathematically this phenomenon known as dispersion, let us rewrite Eq. (1.45) by expressing the wave number rather than the refractive index as a function of angular frequency: U(z t) =

J A(cn)e

+i[k(w)z—wt]

dw

0

= e+i(kz—wt) fA (w)e+i{[_k1z_t} do)

(1.46)

0

e

+i(kz —vit) f A ( (o ) e +i [(a^—w) (

—)] den

0

where

dk

1

d(n w=w Vg

The symbol k is the wave number corresponding to the peak angular frequency w, namely, k = n(w)w/co. Similar to Eq. (1.43), U (z, t) can be perceived as a variableamplitude plane wave of angular frequency w propagating in the z-direction. An example is shown in Fig. 1.5(b). For light with spectral width Aco that satisfies the condition Oco « w, the shape of U (z, t) varies slowly compared with (kz - wt). Within a time interval At « ^^,

(1.47)

the relative phase change between any two frequency components of the light spectrum is much less than 2it. The light can be characterized by a monochromatic wave of angular frequency ci. We call such light quasi-monochromatic. The quantity 1 /Act in Eq. (1.47) is on the order of the coherence time Atcoh, and Alcoh = CAtcoh

(1.48)

is known as the coherence length. To anticipate results to be derived in Chapter 4, interference of quasi-monochromatic light beams is significant only if their path difference is small compared with the coherence length. Mainly because of stringent aberration requirements, light sources used in optical lithography have narrow bandwidths. For example, the spectral width of a typical 193 -nm laser is & = 0.25 x 10 -12 m, corresponding to a coherence length on the order of c

^,2

Alco h — c — — ^_ 0.15 m.

Aw Ak


18


Since the path length difference of light beams used in optical lithography is small compared with 0.15 m, we can treat light as a harmonic wave that has a mean frequency w and mean wavelength ? , and enjoy all the conveniences of phasor mathematics.

Exercises 1.1 For any smooth vector a, show that V • V x a - 0. 1.2 For any function f, prove that V x ©f - 0. 1.3 For any two smooth vectors a and b, show that a•Vxb—b•Vxa--V•(axb). 1.4 Divergence theorem (Gauss's law). Let V be a volume bounded by a closed surface

S, with outward surface normal n. For any smooth vector a, prove that

s

f a•ndS= vJ V•adV.

Hence, show that

Js Fee•ndS=Q, where Q is the total charge enclosed within a volume of homogeneous medium of permittivity F,e bounded by a closed surface S. 1.5 Stoke's theorem. Let S be a surface with the boundary curve C. Show that for any vector a, a - idl Vxa•ndS.

c

f

Js

1.6 Using the constitutive relations of Eqs. (1.7)—(1.9), derive Eq. (1.13) from Eq. (1.12). 1.7 Derive Eq. (1.16) using Maxwell's equations. 1.8 Show that the wave equations [Eq (1.18)] suggest electromagnetic waves propagate with a speed of c. Do the equations imply that all solutions to the wave equations propagate at this speed? 1.9 Derive Eq. (1.21) from Eq. (1.20). 1.10 Derive Eq. (1.23). 1.11 Derive Eq. (1.30). 1.12 Derive Eq. (1.40) from Eqs. (1.1) and (1.2). 1.13 Derive Eq. (1.42). 1.14 Referring to Eq. (1.43), derive expressions for the quantities (w/k) and (dw/dk)^-0. Interpret these quantities.


Chapter 2

Elements of Geometrical Optics The wavelengths of concern in photolithography (on the order of 10 -7 m) are small compared with most physical dimensions. In situations such as transmission through lenses, we can approximate the propagation of light by neglecting the finiteness of its wavelength. In this chapter we derive that in the limit of zero wavelength, Maxwell's equations give rise to optical laws that can be formulated in the language of geometry. Light energy is transported along rays that are orthogonal to geometrical wavefronts. Optical phenomena can be deduced by determining the paths of light rays and their intensities.'

2.1 The eikonal equation Consider a time-harmonic field E(r)e — ` wt and

H(r)e `wf —

in a nonconducting isotropic medium free of currents and charges. The phasors E and H satisfy Maxwell's equations of the following form: OxE— iOO,u m H=0, V XH+i0 e E=0 V Ee E = O, v- fm H=0.

(2.1)

If we represent the spatial dependence of the field by

E ( r ) = Eo(r)e+txow(r)

and

H(r) = Ho(r)e +tko

)

'Born and Wolf [6] contains detailed discussions on geometrical optics as well as diffraction and partially coherent imaging, topics we shall explore in subsequent chapters.

19 Downloaded from SPIE Digital Library on 25 Feb 2010 to 130.60.68.45. Terms of Use: http://spiedl.org/terms

Elements of Geometrical Optics

20

with (p(r) being a real scalar function and Eo(r) and Ho(r) are complex vector functions, Eq. (2.1) becomes, in the limit of small 2o (see Exercise 2.1), Ocp x Eo — cou„Ho = 0 , V p x Ho + coe,Eo = 0, Eo•Vp=0,

(2.2)

Ho•Ocp=0.

Substituting the first equation of Eq. (2.2) into the second gives (see Exercise 2.2) V (p(V (p • Eo) — Eo (V^P) 2 + cpgeµ m Eo = 0. The first term vanishes as a consequence of the third equation of Eq. (2.2), resulting in (2.3) (V p) 2 = n 2 . Equation (2.3) is the eikonal equation•, it is the basic equation of geometrical optics. The function cp is the eikonal (also called the optical path). Surfaces on which (p(r) = constants have the same optical path. These surfaces are called the geometrical wavefronts.

2.2 Light rays The time-averaged electric and magnetic energy densities in the limit of geometrical optics can be derived by substituting Eq. (2.2) into Eqs. (1.36) and (1.37): <We >= 4E•E* -

4co ©cQ • (Eo x H4)

=<W >. M

The Poynting vector can be derived in a similar fashion: <S>= 1 Re{ExH*} = 1 Re{Eox(VpxEá)} 2coum

=C<W>

v

IP . n

From the eikonal equation [Eq. (2.3)], (Ocp/n) is a unit vector perpendicular to the wavefront. Denoting this unit vector by "s, we have <S>=c<w>s".


(2.4)

Light rays

21

Figure 2.1: Light rays travel in straight lines in homogeneous media. The Poynting vector points in the same direction as the normal to the wavefront. Its magnitude is equal to the product of the energy density and the speed of light in the propagating medium. This interpretation is similar to that of Eq. (1.29). Equation (2.4) indicates that in the limit of geometrical optics, light energy is transported along light rays with trajectories that are orthogonal to the geometrical wavefronts:

(p(r) = constants, and the light intensity is the absolute value of the time-averaged Poynting vector: I = I <S>

I.

(2.5)

As an example, consider light propagation in a homogeneous medium with a constant refractive index. If we parameterize the position vector r as a function of the distance traveled by a light ray s in the eikonal equation, namely, r = r(s), then dr/ds =s, and Eq. (2.3) can be rewritten as

dr

= Vcp(r).

Differentiating the above equation with respect to s results in (see Exercise 2.3)

ds {n dr ) ds

(©cp) = Vn. (2.6)

Since the refractive index is constant, Eq. (2.6) becomes

der =0 ds 2

or

r=a+sb,

where a and b are constant vectors (see Fig. 2.1). Light rays travel in straight lines in homogeneous media. Let us now investigate the changes in the intensity of a light beam as it propagates. Since, from Eq. (1.39), the time-averaged flux through any closed surface is 0, we obtain (2.7) Is•ndS=0, s

f


22


12 dS2

7

Figure 2.2: Intensity variation along a light beam. where n is the unit outward normal to the surface S. Consider a narrow tube formed by all rays traveling from a wavefront element dSl to another element dS2, as illustrated in Fig. 2.2. The dot product • n is —1 on dSl, s•fi= +1 ondS2, 0

elsewhere.

Equation (2.7) becomes I1dS1 = I2dS2,

where Il and I2 are the intensities on dSl and dS2. The quantity IdS is constant along a tube of rays. As an example, consider a time-harmonic spherical wave diverging from the origin in a homogeneous medium of refractive index n. From Eq. (1.31), this wave is dependent only on r = jr. We can express the field as U(r) = U(r) = Uo(r)e +ikocp(r) = A e —ikon

e +tk0 W r

where A is the amplitude of the wave at unit radius and the time dependence e - z wr is implicit. From the above equation we observe that Uo(r) = Uo(r) =

Ae —ikp n

ç(r) = cp(r) = nr, r s=

Irl

T

and

.

The wavefronts are spheres centered at the origin and the light rays radiate from the origin in straight paths. From Eq. (2.7), the intensity is inversely proportional to the square of the distance (see Exercise 2.4): I °c Y2 .


Snell's law

23

D

=0 inc

A ninc

B •

F

ntran

d

z

d ° G

(C)

etran

n Figure 2.3: Snell's law of refraction.

2.3 Snell's law Let us investigate the behavior of a light ray as it propagates across a dielectric discontinuity. Illustrated in Fig. 2.3, a light ray AB traveling in a medium with refractive index ni ne impinges onto a dielectric interface at an angle 9i nc . In general, a reflected ray and a refracted ray result, with both secondary rays having linear trajectories by virtue of Eq. (2.6). We defer detailed treatment of these waves until §6.3. Sufficient for the present is the relationship between the angles 8inc and O, where °tra„ is the angle between the refracted ray and the normal to the interface. Suppose the optical path is zero at B, namely, cp(B) = 0, the optical path at C is (p(C) = ntrando.

Now consider another light ray DEF parallel to AB. The wavefront BE has zero optical path, and the wavefront with optical path cp = (p(C) is defined by CG, where G lies on the refracted ray corresponding to DEF. Since C and G are points on the same wavefront, (P(G) = nincdl +ntrand2 = (p(C) = ntrand0. Expressing the quantities do and dl in terms of 8i nc , 6tran , and A results in Snell's law (see Exercise 2.5): nine sin 6 inc = ntran sin 0t an (2.8) A light ray bends toward the normal if the second medium has a higher refractive index, and it bends away from the normal if the incident medium is optically denser. ii "In optically anisotropic materials, each incident ray in general gives rise to two refracted beams. This phenomenon of birefringence is discussed in Appendix A.


24


g1

"n

spherical ^^- wave-front

L

S2

nambient

n lens

3

(a) Figure 2.4: (a) A parallel light beam incident upon a thin lens. (b) Refractions at two surfaces of a lens.

2.4 Thin lens Consider a parallel light beam propagating in the z-direction in a medium of refractive index nambient, which is incident upon a lens L, comprising two spherical surfaces Sl and S2 made of a material of refractive index nlens > nambient, as shown in Fig. 2.4(a). Each light ray undergoes two refractions, one at each lens surface, as illustrated in Fig. 2.4(b). In general the exit ray is translated compared with the incident (A xy 0). However, if the lens is so thin that we can assume a light ray entering it at coordinates (x, y) exits at the same coordinates on the opposite side, the effect of such a thin lens is merely an increase in the optical path of a light ray depending on its entry point. To find the change in optical path as a function of the entrance coordinates, we consider the two parts of the lens Ll and L2 separately, as shown in Fig. 2.5. The left part, L1, contains a spherical surface Sl with a radius of curvature R1. We use the convention that the radius of curvature is positive if the spherical surface is convex, namely, the center of curvature lies to the right of the surface, and negative if the surface is concave. The thickness function of Ll is di(x,y) =dog

—

[R1

—

R1

—

(x2 +y 2 )]

x2 + y2 (2.9)

=do —Rl 1— 1— [

R i

'

where do l is the thickness of Ll at (x,y) _ (0,0). For paraxial rays, namely, light rays that travel sufficiently close to the axis (x, y « R1), we can approximate the radical in Eq. (2.9) by the first two terms of its Taylor series expansion: 1— x2+y2 -1— x2+y2

Ri

2R1


Thin lens

25

Si (x,Y)

L1

Ri (x 2 + y 2 )

do,

—H hdo2

y

Figure 2.5: Separation of a thin lens into two parts for analysis.

such that

x2 + y2

di(x,y) =dog— 2R1 Similarly, the thickness function for L2 is x2 + y 2 d2 (x, y) = do, + 2R2

where d0 2 is the thickness of L2 at (x, y) = (0, 0), and R2 < 0 for L2 depicted in Fig. 2.5. The total thickness function is 2 1 d(x,y)=dl(x,y)+d2(x,y)=d0—2 +y 1 R l R2J

where do = ( do, + d0 2 is the thickness of the lens at (x, y) = (0, 0). The optical path change of a light ray upon transmission through the lens at (x, y) is )

L I R1 —1 I

x2 -i-- y2 (p(x, y) = nlensd0 — (Wiens — nambient) 2

(2.10)

x2 + y2 — nlensd0

where

l

,

— 2f'

1

f = (%liens — nambient) Ii

— R2 I If f> 0, the axial ray undergoes the greatest path change; the optical path decreases from the center of the lens in a parabolic fashion. Consider the wavefront corresponding to (P = nlensd0• It is a paraboloid described by the equation z= 1 (x 2 y2 )


26


with the origin at the center of the lens. For paraxial rays, this paraboloid approximates a spherical surface with a radius of curvature f. We call f the focal length of the lens and the plane z = f the focal plane of the lens. When an incoming light beam is parallel, namely, the beam originates from a point infinitely far away from the lens, the lens focuses the beam to a point on the focal plane [Fig. 2.4(a)]. Conversely, a point source located on the focal plane results in a parallel beam of light upon transmission through the lens. For a lens with f < 0, a parallel beam of incident light becomes a diverging spherical wave emanating from a point on the focal plane. This plane is a distance f from the lens on the incident side. This conclusion of the effect of a thin lens with spherical surfaces, that it maps an incident plane wave into a spherical wave, is valid only for paraxial rays. The emerging wavefronts, in general, deviate from spheres even if the lens surfaces are spherical. Such deviations are called aberrations, a topic we shall explore in Chapter 7. Imaging systems in optical lithography comprises many lenses, and their surfaces may be designed aspherical in such a way that the emerging wavefronts of the assembly are as close to spherical as possible over the full exposure field. In our diagrams and analyses, we represent a group of lenses that collectively map an incident plane wave into an almost spherical wave by a single thin lens, with the understanding that our thin lens corresponds to many lenses in an actual system.

2.5 Representation of an exposure system The requirements of large field size, low aberrations, and wafer-specific adjustment necessitate complex optical systems in projection microlithography. Such systems may consist of more than forty lens elements, and the whole assembly can weigh more than six tons. The lens design of such a system is shown in Fig. 2.6. Exact analysis of its operation is complicated, requiring tracing light rays through all lens elements over the entire exposure field [7]. But the basic imaging process can be represented by the schematic drawn in Fig. 2.7. The object (photomask) is placed on the front focal plane of L 1 , a distance f from the lens. A second lens L2, called the projection lens, having a focal length of fM, is placed at the back focal plane of L1. Since the object is placed on the focal plane of L1, light rays from each object point results in a parallel beam upon transmission through L1. This parallel light beam is focused onto the back focal plane of L2, forming an image at a distance fM from the second lens. The lateral magnification (or magnification) Mlaterai of the system can be derived by tracing the trajectories of light rays. First we consider the ray ABCD. Since AB is parallel to the optic axis, the ray intersects the optic axis at C, a distance f from L1. Since the point C is also the center of L2, the ray BC continues propagating in a straight line, intersecting the image plane at D. Now consider another ray radiating from A that passes through the center of L1. Being parallel to BC, this ray


Representation of an exposure system

27

Figure 2.6: An optical projection microlithography design comprises many lens elements. [From W. Ulrich, et al., "Development of dioptric projection lenses for deep

ultraviolet lithography at Carl Zeiss," JM3 , vol. 3, no. 1, pp. 87-96 (Jan. 2004).]

Yo

B , Y^

ó

'i

,

p object

1î R

C

Yi

E I ` S2 Si 1Lit L2 ,

f

z

x

1 1 ..

f

image y

fM

Figure 2.7: Representation of an exposure system for imaging analysis. continues propagation in a straight line until it incidents upon L2 at E. The exit ray from L2 intersects the first ray (ABC) at D, since parallel rays converge to one point on the focal plane upon transmission through a lens. If the object dimension is yo and the image size is yi, the lateral magnification of the imaging system is Mi aterai = Yi =

Yo

f M = M.iii f

(2.11)

The relationship between the angles O o and 5i can be determined from the ray OPQR. For R to be the image of 0, the spherical wavefront Si emanating from O should be mapped onto the spherical wavefront S2 converging toward R. Since, under the thin lens assumption, the (x,y) coordinates of P and Q are the same, we "Images produced by the optical system depicted in Fig. 2.7 are inverted replicas of the objects. The relationship between object and image coordinates is xi = M' xo

and

Yi = M ' Y0,

with M' IfoI

-1

>

— a,

+6 2 ( sin (20) + 0) 2 sin

1 (a/a)

IJoI-1>6,

0

&1 with a= IfoI2+ 2 IfoI

(5.15) The TCCC (I fo I ; 0) curves for 6 = 0, ß = 0.2, 6 -- 0.8, and 6 = 1 shown in Fig. 5.5 illustrate both the improved resolution and the distortion of low-frequency components. Although increasing the partial coherence factor 6 reduces the minimum halfpitch, the resolution improvement is not without limit. Beyond 6 = 1, the effective Downloaded from SPIE Digital Library on 25 Feb 2010 to 130.60.68.45. Terms of Use: http://spiedl.org/terms

Off-axis illumination

83

O

a=0 ó

a^.z

CP

a=fl.ó

00

U

H ^ a=1

d N O

0.0

0.5

1.0

1.5

2.5

2.0

f (NAA.)

Figure 5.5: TCCX(l foI;0) for various partial coherence factors [23]. source is bigger than the pupil. The TCC is no longer calculated from the overlap of the effective source and the pupil circle centered at (—I/o 1, 0). Rather, it is proportional to the overlap of the two pupil circles centered at (0, 0) and (—I/o 1, 0). TCC, ( ^ fo l; 0) is given in form by Eq. (5.15) with a = 1, but the magnitude is scaled by (1/6 2 ) representing light loss due to over-filling of the pupil by the illumination. The theoretical resolution under partially coherent imaging is a spatial frequency of 1+6 6 1, corresponding to a minimum half-pitch of 2 1+a A 6

—

l

hn,;n = NA >1. 4 G

Only with c> 1 can we achieve the full resolution potential of the optical system.

5.3 Off-axis illumination Increasing the partial coherence factor allows transfer of higher spatial frequencies. But these frequencies are attenuated. We can lessen such distortion by illuminating the object described by Eq. (5.5) or Eq. (5.9) with an off-axis point source located at ( — /o/ 2 , 0 ):

J(.f,g) = s(f+/o/2)8(g), with Jo >0.

(5.16)

Under this situation, TCCz (fo; 0) is the overlap between a point at (— fo/2, 0), a unit circle centered at the origin, and a unit circle centered at (—Jo, 0). Illustrated


84

Resolution and Image Enhancement

(a) I.fo I < 2

(b) I.fo I > 2

Figure 5.6: TCCx(fo; 0) for (a) I fo I < 2, and (b) I fo I > 2. Illumination is coherent, with the effective point source located at ( Jo/ 2 ,O). —

in Fig.5.6,

0

1

TCC1 (; f0 0 )

if lfoI < 2 , otherwise.

The resolution limit is a spatial period of 1 x0 Pìn

= 2 NA'

corresponding to a frequency of

p=2. The minimum half-pitch is equivalent to imaging with 6 > 1. But image quality of the periodic space is enhanced because TCCX(fo; 0) = 1 for I fo I _

< axaye+`(^X-^y) >

[< ax ay e - `(Ox Y > -)

)

< ay >

.

The diagonal elements of J are real, representing the light intensities in the xdirection and the y-direction." Their sum is the total intensity. We can normalize the off-diagonal elements of the coherency matrix: Jxy

9xy

=

Jxx J

_ ,uyx

The complex correlation factor ,u, ), is a measure of the correlation between the electric vector vibrations in the x- and y-directions. With magnitude bounded by one (see Exercise 6.2), it is analogous to the complex degree of coherence Y (Pi, P2) that characterizes the correlation of vibrations between two points in a wave field (see §4.3.1).

The intensity in Eq. (6.3) can be expressed in terms of the directional intensities and the complex correlation factor: I(i3,4) =JXx cos 2 i +Jy ,sin 2 15+2

Jyycos15sine Re{µxy }cosh.

(6.4)

For unpolarized light (natural light), the intensity is independent of the phase retardation 0 and the observation angle i:

I(*, 0) = Jo = constant. "J and Jyy are analogous to I(Pi) and 1(P2) in Eq. (4.14).


100

Oblique Rays

This condition is satisfied if, and only if (see Exercise 6.3), J = Jyy and

,uxy = 0.

The coherency matrix of unpolarized light is

J=2I0 1 . ]

If the light is polarized, the amplitudes and phases of Ex and Ey are timeindependent apart from the factor et. The coherency matrix becomes ax

J= 1

axaye —l4

axaye+ß(4x— )

4')

ay

Its determinant is 0, and the complex correlation factor has a magnitude of 1: ,uxy = e+l(Ox—^Y).

In the special case of linear polarization [0x — oy = nn, n E Z, see Eq. (6.2)], the coherency matrix is

r

ax J = (-1)naxay

(-1)naxaY

l

ay

When ay = 0,

J =7 l0 gj• 0

The light is polarized in the x-direction. Similarly J=Io [0 0] represents y-polarized light. Let us determine the coherency matrix for light that comprises several independent waves traveling in the same direction. Suppose there are N component waves and they propagate in the z-direction. If E,, and Eyn denote the components of the electric vector arising from the nth component wave, the electric field vector of the composite wave is N

EX = , E

N

Ey = 1 Ey .

n=1

n=1

The element J11 of the coherency matrix is

J^^ =< EZEE > N

_ <Eln EJn >+ly,<Eil E^m >. n=1

lam


101

Polarization

Since the component waves are mutually independent, the double sum over l and m is 0, and N

N

^ +

Jij =

LI

< EZ n Ejn >=

Y, Jl jn' n=1

n=1

The coherency matrix of the composite wave is equal to the sum of the coherency matrices of the component waves. Because of the addition property of coherency matrices, we can express unpolarized light as:

1 J 2 r Î .

(6.5)

J= 2 ^^ 0 = 2 [ 0

Natural light of intensity IoL is equivalent to two indepen dent linearly polarized waves, each of intensity Io/2. Their electric vectors vibrate in mutually perpendicular directions that are both orthogonal to the propagation direction. Any quasi-monochromatic light can be uniquely decomposed into the sum of a polarized and an unpolarized wave: xx JrY

J = Jpol + Jurapol =

^jYx

JYY

where J po1 =

Ipo1X

µxy J. JYY

µxy Jxx Jyy

Ipoly

and Jurapol

= Îunpol/ 2 0 1

0

Iunpol/2

J

with (see Exercise 6.7) Iunpol = ( Jxx + JYY) — Ipo1x =

(Jxx + JYY) 2 —4J1,

(J^x — J)+ (Jxx+Jyy) 2 -4 2

(J

Jxx) + (Jxx +J) 2

Ipo1y =

2

1J1

— 4 1J1 (6.6)

Jxy µxy=

Iii = det(J) = Jxjyy — Jxy Jyx . The ratio of the amount of polarized light to the total intensity is the degree of polarization of the wave P: __ 'pol

41J1 (

P Ipol +Tonpol = \/ 1 l —(Jxx + J \ 2

6.7 )


Oblique Rays

102

When P = 1, there is no unpolarized component; the wave is polarized. In this case IJI = 0, ,uXy = 1, and Ex and Ey are coherent. When P = 0, the wave is unpolarized. In this situation (J +J) 2 = 4IJj, namely, (J

—

JYY ) 2

+ 4J^yJyx= 0 .

Since Jam, = Jyx , this condition is satisfied only if J,x, = Jyy and

Jxy = Jyx = 0.

Ex and Ey are incoherent (t = 0). For situations between these two extremes, 0

ddzz)

e)

U(z).

(6.48)

Upon differentiation with respect to z and cross-substitution, these equations become d2U(z) _ d (lnic m dU(z) 2 2 o (S — n Z) U (z) + k d z dz dz2 )

(6.49)

viiTe general solution is a superposition of two waves: X(x) = cx,e`y`0x +cg2 e —i ch0x

Under the current consideration of one plane wave, only the first term is retained.


Stratified media

119

and Zk2

d2 v ( z ) — d[ln { z (; 1m —£e)] dv(z) -

dz

dz 2 dz

+ß(S2 —n 2 )v(z).

(6.50)

Since both U (z) and V (z) satisfy a second order differential equation, they can be expressed as linear combinations of two particular solutions, namely, U (z) = c 1 Ui (z) + c u2 U2 (z) and

V (z) = c 1 VI (z) + cV 2 V2 (z) . These particular solutions [Ui (z), Vi (z)] and [U2 (z), V2 (z)] are coupled by Eqs. (6.47) and (6.48), meaning Vl (z) dU2 (z) — dUl (z)

dz

dz

V2 (z) = 0

and Ul (z) dV2 (z) — dVl (z)

dz

dz

U2 (z) =0.

Together they imply C^Z

[Ul(z)v2(z)_U2(z)V1(z)] =0

(6.51)

or

Ul (z)V2 (z) — U2 (z) V, (z) = constant.

6.4.2 Characteristic matrix To simplify the mathematics, let us pick particular solutions that satisfy the following conditions: U1(0) = V2(0) = 1

and

U2(0)=V1(0)=0.

(6.52)

With these conditions, the solutions of Eqs. (6.49) and (6.50) with U(0) = Uo and V(0) = Vo can be written as U(z) = Ul (Z) Uo + U2 (Z)Vo

and V(z) =V1(z)Uo+V2(z)Vo•

We can express these equations in one matrix equation: Ul (z) U2(z)

Uo = NQ(0). vo [v (z) I — [vi(z) (z) V2

Q(z) = U(z)

(6.53)


120

Oblique Rays

The determinant of N is 1 (see Exercise 6.20). Inverting the above matrix equation

gives Q( 0 ) =N —' Q(z) _

1 Vi (z) Ui(z) —

)

1

(6.54)

IV (z)1 MQ(z)

The determinant of the matrix M is also 1. Given M, knowledge of the fields at a particular z-plane is sufficient for determining those at other planes. The matrix M is the characteristic matrix of the stratified medium. Let us derive the characteristic matrix of a homogeneous dielectric film, with a TE plane wave propagating in a direction making an angle 0 with the z-axis. In this situation, S = n sin 0 [see Eq. (6.46)], and Eqs. (6.49) and (6.50) become d'-U(z) 2 2 dz2 +kón cos 0U(z) = 0, d'-V(z) a d`2

+ kón cosa0 V (z) = 0.

The solutions are U(z) = cl cos(konzcos0) + c2 sin(konzcos0), V(z) = i

m

cos0[ —cl sin(konzcos0)+c2cos(konzcos0)].

The particular solutions that satisfy the conditions stated in Eq. (6.52) are

Ui(z) = cos(konzcos0), U2(z) = 1 ^m sin(konzcos0), icosøV E e Vi (z) = —i cos 0 £e sin(konz cos 0), V2(z) = cos(konzcos 0), and the characteristic matrix is

M (z)

-1

Ui (z) V(Z) Vi

-

)1

cos(konzcos6)

pose £e sin(konzcos6)

i cos 6m sin(konzcos 0)

cos(konzcos A)

(6.55)

j


Stratified media

121

Since Maxwell's equations are invariant under the simultaneous substitutions .ii E H

and

Ee

-

/Lm,

the characteristic matrix for a TM wave is cos(konzcos0) MTM(z) _

—icos8 £e sin(konzcos0)

ôsè µm sin(konzcos0) cos(konzcos 0)

with Hy(x,z,t) = U(z)e` (5kox-0)r) and Ex (x z; t) = V

(z)ei(Skox—wt)

The sign change of the off-diagonal elements originate from replacing —icqu m by iWEe in Eq. (6.47). We can therefore define a general characteristic matrix as M(z) —

cos (konz cos0) [ixVsin(kOnzcosO)

, sin(konzcos8)1 cos(konz cos 0) JI '

(6.56)

with NJ = cos 8 £

e

= cos 8

in the TE polarization, and

/1m

(6.57)

yr = —cos 8 ^m = —rl cos 8 EQ

in the TM polarization.

Since the tangential components of the electric and magnetic vectors are continuous across dielectric interfaces (see Exercise 6.9), the characteristic matrix of a homogeneous dielectric film can be extended to describe a stratified medium comprising two materials, the first extending from z = 0 to z = zl and the second from z = zi to z = z2• If M1(z) and M2(z) are the characteristic matrices of the two materials, then and

Q(0) =Mi(zl)Q(zi)

Q(zi) =M2(z2 — zi)Q(z2),

and Q(0) =Mi(zi)M2(z2 — zi)Q(Z2).

This expression can be generalized to a stratified medium consisting of q layers with a total thickness of d, with di denoting the thickness of the ith layer: Q(0) = MI(di)M2( d2)...Me(dq)Q(d) nMi (di ) [

Q(d)

i=

(6.58)

= M(d)Q(d)

= `m 11 m 121 rU(d)^

LM21 m22

lV (d)


122

Oblique Rays

Wave transmission through the entire stratified medium can be characterized by the four matrix entries ml1, m12, m21, and m22.

6.4.3 Reflection and transmission Let us now examine the reflection and transmission of an electromagnetic wave upon incidence onto a stratified medium, as illustrated in Fig. 6.7. Consider first the TE polarization, in which the electric field vector is polarized in the y-direction. Following the approach taken in §6.3.2, the electric field in the incident medium is E =E + E refl = aIe i13inc + rle il^refl Y y y

The variables 1j and i3refl embed the spatial and temporal dependence of the incident and reflected waves. It follows that U(0) = a1 + rL. Using Eq. (1.26), we can write V(0) = 1 Vinc ( — a1 + r1),

where, as defined in Eq. (6.57), cos einc Winc =

Tlinc

The transmitted fields are U(d)=t1

and

V(d)=—yr.tL.

Equation (6.58) becomes [Vinc( a_L+r_L)l r — [m21 m22] ^—Want] . —

Solving this equation results in r1 _ Vine (m11 — 1IJtranml2) + (m21 — Wtranm22) (6.59) Nlinc(m11 — 19tranml2) — (m21 — Wtranm22) ' P1 = a1

_

t1

_

21Vinc

(6.60)

T1 a1 zinc (m11 — Wtranml2) — (m21 — Wtranm22)

In the situation where the stratified medium is absent (see Fig. 6.5),

M=[ 0 10 Downloaded from SPIE Digital Library on 25 Feb 2010 to 130.60.68.45. Terms of Use: http://spiedl.org/terms

Stratified media

123

inc

refI

ninc

aL

Xr,

11

ereil ereil

,

a II

d

n P =n'ß,(1+ivP ) a P 0 r layerp 0 0

bt

0

n q =n'Q (1+

I

Z=O dp

k su sac

I dq J

ginc

ntran

etran

Yr X

tL

1

Z Z

til gtran

Figure 6.7: Reflection from and transmission through a stratified medium.

and —

Pi

Tltran COS e inc — 1linc COS 9tran 11tran COS ejnc + Tlinc COS etran

21jtran COS e inc 11tran COS Ainc + 1 1inc COS Atran

We recover Eq. (6.25). The matrix equation for a TM wave can be similarly derived: (a ll + rll)

cos 6inc(all — rll)

mll m12

1_"^tII

m21 m22

cos Ot tll

Solving this equation gives rij 1Vinc(ml1 — tranm12)+(m21 —ltranm22)

P11=— all

(6.61)

1 Vinc(MI1 — Nftranm12) — (m21 —1 Vtranm22)

tij —2tltran cos 6inc ( ill= — = tinc(mil — Wtranml2) — (m21 — Wtranm22)

(6.62)

all

where yr is defined in Eq. (6.57). Although Eqs. (6.59)—(6.62) have been derived under the assumption that the plane of incidence is the xz-plane, they are applicable for arbitrary planes of incidence, provided the electric field vector is decomposed into perpendicular and parallel components.


124

Oblique Rays

6.5 Intensity distribution in photoresist`'nl Referring to Fig. 6.7, suppose that the photoresist, with a thickness dp, is the p th layer of a stratified medium consisting of q layers. To determine the resist image, let us define a substack that is a stratified medium comprising the layers (p + 1) to q of the original stratified medium, and an angle 6 p between the z-axis and the +z-propagating plane wave in the photoresist corresponding to an incident angle 6;ßc such that ninc sin eint = nimage Sin eine = np sin Op,

np = , (1 + iKp ) .

The amplitude of the +z-propagating TE wave in the photoresist a1 is related to the incident TE amplitude al by a P ,substack = a 1 I

stack 1 I

where the superscript "substack" refers to quantities related to the substack and the superscript "stack" denotes quantities associated with the entire stratified medium. The electric field within the photoresist layer arising from an incident TE plane wave of amplitude a 1 is stack

ikpfl•r _ ti1 ikP`•r E1— al substack [e + p1 substacke

I

1

stack

= ei(kPx +kP) a ti l ikp1—z [ e+(d1) +

1

substack

1

_ kP,

substack e — ikp(d,—z) p1 ]'

where kP e (kp, kP) and kp fl = ( kP, k, —kf) are the wave vectors of the +z traveling and —z-traveling plane waves in the photoresist. Since the parameters tack , tïbstack, , ,? stack , and kp are functions of B ob] and z only, the ratio of the photoresist electric field amplitude to the incident amplitude is stack

ELI

ti l ikz ( p — z) + p s bstacke— ikP(dp e+' a1 ,substack =[ 1

—z)1 = IF±(Pe, z) I.

(6.63)

1

Equations for the TM polarization can be similarly derived. According to the convention of Fig. 6.7, the x- and y-components of the reflected wave subtract from the incident wave, whereas the z-components are summed: E'Y a

ll

I=I =I

,stack

substack [ e

+ikp(dp—z) — p 5tack e — ikP(dP

—z)1 = IF (pe,z)I

(6.64)

IFI (p e z)I.

(6.65)

I

EZ ,stack

[e+d_z) +p llubstack e — ikP(dd —z)1 =

substack a ll Ts J v iii The

exposition in this section follows Yeung [42] and Flagello [43].


Intensity distribution in photoresist

125

The effect of the stratified medium is a scaling of the amplitude of each imageforming ray of the spherical wavefront by the factors FL, Fr', and F, depending on the incident angle of the light ray and the z-position within the photoresist. The quantities PP11 and FIIj in Eq. (6.13) should be scaled by the appropriate factors. For example,

P X —^ F1PX j,

and

PYllz

Fj PYllz

The vector field at an image point P arising from a ray is a modification of Eq. (6.13):

Pxi Py F1 Fxy 0 0 01 ' x 'yMX E(P) = 0 0 Fi FxY 0 Px1y PYLY

0

0

Msxx Ms = Mss, Ms

0

0

FI

PxIIY PYIIY PXllz PYIIZ

Êy J (6.66)

1 Ex = MstackEo,

M&Z Msy^

where Ms.(po;z) = FLPX +F Px l x Msyx(pe;z) = F1Py ix+F PYIIx Ms, (pe; z) = FLPXLy + FXYPxIIY , Msyy (00; z) = FLPy 1y + F PYIIY, (6.67)

Msxz (Po; z) = F^ Pp,

Msy, (Pe; z) = F, PYIIz.

The resist image intensity is

I(,9)=nP f. f iu,g) 7 (ƒ+f^,g+g)H * (f+f^^,g+g^) O(J ,g )Mstack(f +.f' , '

'

+S' ;z)Eo .6* (1" , ")M Ck(f + J", + ";z)E e -^ 2n[(f'-f")x+(g'- ')y] d fdgd f, d g d j" d g

=nP ...ƒju,g)H(f+.P,g+g)H*(f+f^^,g+g^) i ={x,y} j={x,y}

Msik(f + f^ g+g^;z)Ms;k (f + f" g+g";z)EiE^

k={x,y,z}

6 (.jß , S )O * (.Í n

g') e -

^2 ^ [( f_ f)X+(s^- )yl d fdgd J' d g d f"dg'.

(6.68)

Because of reflections from the photoresist interfaces (characterized by F L , FIx, and F^ ), the resist image exhibits a standing wave whose ratio decreases with resist


126

Oblique Rays

Figure 6.8: (a) Reflections from material interfaces create a standing wave in the resist image. (b) Post-exposure baking reduces the variation, resulting in a smoothed latent image. depth due to absorption (KP ). ïx An example resulting from the exposure conditions detailed in §D.4.1 is shown in Fig. 6.8(a). Post-exposure baking smooths any standing wave that may be present such that the latent image corresponding to the resist image of Fig. 6.8(a) may resemble that shown in Fig. 6.8(b).

6.6 Immersion imaging A reason for using light rays at large angles with respect to the optic axis is resolution, since both minimum half-pitch and minimum dimension decrease with increasing numerical aperture NA = n; n,age sin 8ob^ . In addition to using a large semi-aperture angle 9 obj, we can increase the numerical aperture by immersing the image space in a high-refractive-index medium [47].x Figure 6.9 shows a schematic representation of an immersion imaging system. This diagram is the same as Fig. 2.7, except that the image space refractive index is nage > 1 and that the focal length of L2 is increased by a factor of nimage• Following the analysis in §2.5 and noting that the angles (po and (pi in Fig. 6.9 are related by Snell's law [see Eq. (6.22)] sin (p0 = ni. ge sin pi, (

i "The

standing wave ratio can be reduced by decreasing reflection from both photoresist interfaces (using top antireflective coating and bottom antireflective coating) as well as increasing resist absorption [46]. The latter approach, however, degrades resist image quality. 'For example, water with a refractive index of 1.45 is a suitable medium at 193 nm.


Imaging with oblique rays

127

Figure 6.9: Representation of an immersion imaging system.

the lateral magnification is

yi

Mlateral = y

_ nimage fM sin (p i

f

sin (po

M'

the angular magnification Mang„lar is

sini5i _ y /nimagefM 1 an a ular — sin150 nimageM' y'/f '

g

and the longitudinal magnification is Mlongitudinal = Mlateral/Mangular = nimageM

2

Compared with an imaging system with the same numerical aperture in which the object and image media are the same [see Eq. (2.12)], light rays in an immersion system travel at shallower angles. The obliquity factor is modified from Eq. (4.11) accordingly: l^g^M2 pé = (6.69) E(0) = 4

1—pé

6.7 Imaging with oblique rays In the scenario shown in Fig. 6.10, where two light rays interfere to form an image, the intensity distribution depends not only on the angle between the rays [see Eq. (5.20)] but also on their polarization. With transverse electric polarization, the vibrational directions of both beams are the same. The total field is the scalar sum of all constituting fields. In the transverse magnetic polarization, however, the field


128

Oblique Rays

bi

a1

bll

all

Figure 6.10: The fields add in a scalar manner in the perpendicular polarization but

add as vector quantities in the parallel polarization. vectors do not align. The fields arising from all beams add in a vector manner to give the overall field distribution. Since

images resulting from the parallel polarization have lower contrast than those resulting from the perpendicular polarization. 7 Let us consider, as examples that are shown in Fig. 6.11, the aerial images of three equal line-space chromium-on-glass patterns [see Eq. (5.7)] of different periodicities. The illumination for each pattern is an optimized dipole described by Eq. (5.17): f(j,g) = S(f+ 1 / 2PX)S(S)•

°

With px = 1 [Fig. 6.11(a)], the TE image shows a higher contrast than the TM image. The unpolarized image is the average of the TE and TM images, a result consistent with Eq. (6.5). When the period decreases to px = 1/v [Fig. 6.11(b)], the two interfering orders [see Fig. 5.7(a)] are mutually perpendicular. The parallel field components do not interfere; the TM image has no modulation. But the TE image still has a healthy contrast. With further reduction of the period to px = 0.6, the TM image exhibits intensity reversal [Fig. 6.11(c)]. The nominally bright area becomes dark and the opaque region becomes bright. The unpolarized image contrast is merely 0.277 despite a TE image contrast of 0.906. Although the fraction of p-polarized light increases as a beam travels through a wafer stack (see §6.3.4 and Exercise 6.17), resist-image quality degradation is less severe than the aerial image contrast loss depicted in Fig. 6.11. As a consequence of Snell's law (see §6.3.1), light rays in photoresists, with a typical refractive index of 1.8, travel at shallower angles, thus alleviating TM image degradation. Figure 6.12 demonstrates the difference between aerial and resist images for the scenario of Fig. 6.11(b), the situation where the TM aerial image has no modulation. Contrast of the TM resist image is 0.747 [Fig. 6.12(b)]. Such degradation of transverse magnetic image quality is sometimes said to be caused by polarization-induced stray light.


Imaging with oblique rays

129

TE

TE TM

TM

0

T

cN

unpolarized

unpolarized^ 0 C

-0.50

-0.25

0.25

0.00

position (2/NA)

(a)

i3

=

0.50

-0.50

1

0.00

-0.25

0.25

0.50

position (A/NA)

(b)Px= 1 /\ TE TM

T

c

unpolarized

a, C

-0.50

0.00

-0.25

0.25

0.50

position (7/NA)

(C)

0.6

Figure 6.11: Transverse electric images show higher contrast than transverse magnetic images.

A TM resist image that has zero contrast or that exhibits intensity reversal is possible only if (see Exercise 6.24)

NA>

n^

(6.70)

Assuming a photoresist refractive index of 1.8, this condition means that NA> 1.273, a value that is not uncommon with immersion systems. In increasing the numerical aperture for resolution improvement, immersion imaging is more susceptible (than reducing the wavelength) to vector effects as light rays can travel at more oblique angles within the photoresist. In our analysis of image formation and adoption of the canonical coordinates [Eq. (4.30) in §4.5], we showed that optical imaging scales with XO/NA. But the 1 /NA dependence is first-order, accurate only if the fields add in a scalar manner. With oblique rays, vector addition of interfering light beams gives rise to a second-order phenomenon that increases in significance with vibration direction misalignment (namely, with NA) and that partly diminishes the first-order resolution improvement. The overall resolution improvement is sublinear with 1 /NA.


130

Oblique Rays

9 TE

TM ..

TE

ó

TM _T

/

N

C N

\

/

/

\

0

-0.50

-0.25

0.00

0.25

0.50

-0.50

-0.25

0.00

0.25

0.50

position (A/NA)

position (AJNA)

(b) resist image

(a) aerial image

Figure 6.12: Resist image quality degradation is less severe than aerial image contrast loss. NA=0.1

R

NA=0.5

y, o

NA=0.9

N C 1

Since c is positive semi-definite, the intensity i(Po) attains its maximum value if the series in Eq. (7.16) has the single term Zj, implying Obal = cjZj. The Zernike polynomials are naturally balanced. Each polynomial combines terms in such a way that the normalized intensity at the geometrical image point is a Downloaded from SPIE Digital Library on 25 Feb 2010 to 130.60.68.45. Terms of Use: http://spiedl.org/terms

140

Aberrations

maximum. For example, spherical aberration is balanced by defocus in Z11, and tilt balances coma in Z7 and Z8. Besides the representation of Eq. (7.10), Zernike polynomials can be expressed according to the Fringe Zernike convention

Z^ (0, 0) = Rn (P)Yj ) • The coefficients of an aberration function expanded in terms of the Fringe Zernike set

c(0,0) = Y, cj, ZZ i

are related to those of Eq. (7.13) by c,=clan. Regardless of convention, an aberration function is in general an infinite sum of the Zernike set. In practice, the series is often truncated after J terms such that i

i

4)(p, 0) = 1 c1Z1 _ Y, c^Z^. i =1

(7.17)

1=1

The number of terms is usually 37 in photolithography applications. More terms are necessary in well-corrected (namely, low-aberration-level) systems where higherorder terms have similar magnitudes to the lower 37 terms.

7.4 Effects on imaging Although the exact impact of aberrations is specific to the object shape and illumination configuration, it is possible to describe the general effects of each Zernike term. In addition to the displacement theorem of Eq. (7.6), the concept of ray aberration is useful for our discussion. Consider an arbitrary imaging ray, say QQ'Pr P, in Fig. 7.1. It intersects the image plane at Pr = (x r , yr ), a point that is, in gen (xo, yo). The displacement of Pr from Po is the ray aberra--eral,difntomP= tion [51]: (Ax, Ay) = (xr

—

xo,Yr

—

Yo)

( a XcI aXcI In terms of the normalized variables (A9) = N o (AXI DY) _ NA RX( J: a(D ko R sin 8obi

51 'i •

ac:I: ac = afag

(7.18) )


Effects on imaging

141

Piston

The first Zernike term Zl (p, 4)) = 1 represents the average phase of the wavefront; it does not affect the image.

Tilt The ray aberration corresponding to a wave aberration represented by = c2Z2 +c3Z3 = c225 cos4)+c32p sin0 is (A ,A9) _ ( 2 c2, 2 C3)The intensity distribution is shifted; but the image is otherwise unaffected. From Eq. (7.6), the amount of displacement is (&,A94) = ( 2 c2, 2 c3, 0 ). Defocus

With defocus aberration ( D(P)0)

= C4Z4 = C4V(2p 2 — 1),

the intensity distribution is unchanged except for a translation by (M,Af,OA) = ( 0,0,4Vc4).

(7.19)

Astigmatism

The effect of the aberration 'D(0 0) = C6Z6 = c« / 1

2 cos 24)

depends on the orientation of the object. A pattern varying only in the x- direction [such as that described by Eq. (5.5)] under coherent illumination with f(f,, g) = S(f , g) means that 0 = 0 or 0 _ Ir, so that (5, 4)) = c6 2 . This is a defocus. A pattern that varies in the 9-direction has a spectrum such that 4) = ir/2 or $ = 3rc/2, and c1(p, 0) = — c6\./p 2 . This is a defocus of the same magnitude in the opposite direction. Features that orient in the . - and 9-directions are focused onto different planes. The other astigmatism term

vp

0

(p, 0) = c5Z5 = c5V op t sin 20

affects patterns oriented at 45 deg and 135 deg with respect to the x-axis. In the presence of Z5 and Z6, different parts of an object experience different amounts of defocusing depending on their orientations. Illustrated in Fig. 7.2, a square opening having an elliptical image is one manifestation of astigmatism. Downloaded from SPIE Digital Library on 25 Feb 2010 to 130.60.68.45. Terms of Use: http://spiedl.org/terms

Aberrations

142

Figure 7.2: The image of a square pattern [Eq. (5.26)] with d = 0.4, tf g = 1, and tb g = 0 under the influence of astigmatism (c6 = 0.1).

Coma The ray aberration corresponding to Z8 = c8 \(3p 3 — 2p) cos 4

is (Al, A9) = C8

/(9J2 + 3

g2 — 2 , 6.Jß).

For a pattern that varies only in the z-direction with J(f g) = 6(f, g), ( A9) =c8\(9fß -2,0).

While the —c82v term describes a constant translation (x-tilt), the c89^f^ factor represents an image shift that varies with the location of the light ray in the pupil, namely, the amount of displacement is a function of the object spectrum. In addition, the f^ dependence means that a ray at —Jo is shifted by the same amount as that at +fo. The image of a symmetrical object generally becomes asymmetrical. Such lateral asymmetry is orientation-dependent in the presence of Z7 and Z8. Nevertheless, the intensity distribution is symmetrical with respect to the z = 0 plane because of the absence of even power terms in p. Shamrock

Similar to coma, the Zernike polynomials Z 9 and Z 10 result in lateral image asymmetries. The 34 dependence means that patterns possessing a threefold rotational symmetry are the most prone.


Measurement

143

Figure 7.3: In the presence of spherical aberration, light rays from the pupil intersect

the optic axis at different points. Spherical aberration A wavefront with spherical aberration [Z11 — 6p 2 + 1) and higher-order polynomials in which YJ (cß) = 1] can be perceived as having a radius of curvature that varies as a function of the radial position in the pupil. An image is degraded because light rays from the pupil intersect the optic axis at different points, as depicted in Fig. 7.3. Such degradation is asymmetric with respect to the Z = 0 plane, although no lateral asymmetry is introduced. High-order polynomials The effects of high-order terms can be ascertained qualitatively similar to those considered above. Light is diverted over a larger area with increasing n values because the ray aberration increases; the diffraction pattern spreads farther. Objects with mo -fold rotational symmetry are the most susceptible to Zernike polynomials with m = mo. Terms in which m is odd result in lateral but not longitudinal asymmetry.

7.5 Measurement The ability to measure aberration is indispensable for the production of high-quality optical elements. Knowledge of a system's aberration is beneficial to its users as it allows them to ascertain its performance and to monitor possible system degradation. There are three categories of aberration measurement techniques. One determines the aberration from the interference pattern formed by the wavefront of interest and a reference wavefront. Another class of techniques deduces the wave aberration from a set of intensity distributions produced by the imaging system in


144

Aberrations

Light source

U

ice

Figure 7.4: Principle of the Michelson interferometer.

question. The third category measures ray aberrations from which wave aberrations are derived.

7.5.1 Interferometry Since we cannot observe phase difference by itself, we need to convert the optical path deviation into an observable quantity. Interferometry determines aberration by interfering the wavefront of interest with a reference wavefront. Phase information can be determined from the resulting fringes. There are many interferometry techniques."' As an example, we study the Michelson interferometer, whose principle is illustrated in Fig. 7.4. A quasi-monochromatic light source is collimated by lens Ll such that a plane wavefront impinges onto the glass plate P with a beamsplitting surface S. The fraction of the incident wave that transmits through S is reflected by mirror M1 and then by S, forming the reference wavefront W. The test wavefront is created by the portion of incident wave that is reflected from S. This light travels through the lens under test (L), being affected by aberrations that may be present, before reflecting from mirror M2 and transmitting through S, resulting in the test wavefront W'. The fringes produced by interference between W and W' can be observed through lens L2.

7.5.2 The extended Nijboer Zernike approach -

The aberration signature of a lens set is usually obtained using interferometry prior to system assembly and shipment. But this signature may not represent the aberra"'Malacara [52] provides a catalog of interferometry techniques and descriptions of their principles.


Measurement

145

tion of the optical system because a final adjustment is usually performed at installation. It is desirable to measure the aberration of the installed optical system and to track its evolution. When in-situ interferometry [53,54] is unavailable, we need to determine aberrations by less direct means. One approach is the extended Nijboer-Zernike method [55]. In this technique, the through-focus coherent transfer function is represented by a linear combination of basis functions expressed in Bessel series. The coefficients of the basis functions are the same as the Fringe Zernike coefficients representing the pupil function. These coefficients are estimated by matching the theoretical and measured intensity distribution through focus. For an aberration function described by

(D(P, 0) =1 c Rn (P) cosm4, the field resulting from a point object is U(P,2) = 2 UÓ (P, 2)+ 2 î

m+l cJ n

(P1Z)cosm4,

j

where

n

V (P,2) = e

+i2 (-2i2)u-1 q buvJin+u+2v(2TCP)' u=1

v=0

u(27EP)u

(-1)g • (m+u+2v) • (m+v+u— 1)!(v+u-1)!(p+v)!u! 1 v + v + u)! u-1 v.)!v-}-u— q — )!(q — )!(p buv ( m+v)!( !( 2=kz(1— 1—sine 8 obi),

n+m p= 2 n—m q= 2 and the relationship between j, n, and in are given by Eqs. (7.11) and (7.12). For low levels of aberration, the intensity distribution is approximately I( P,4,2) = 4 IVó (P,2)I 2 + 8 1 c,Re {im +1vó * (P, 2 )VV (P,2)}cosmi j

=x8(P,z) (7.20) j

where +1UÓ (P 2 X (P>z) = f,Re {i ) V,, (P,Z)1, *

n

_ 4 8

ifn=m=0, otherwise.


Aberrations

146

We need to estimate the coefficients c'j given a set of measured intensity distributions at various focus levels 2. Performing a Fourier analysis of the observed intensity distributions Imes (p, 2) with respect to the angular variable gives 21c

Wmea(P, 2 ) =

1

f Imea(p,^^z)cosm0Ci0.

(7.21)

0

Comparing Eq. (7.20) and Eq. (7.21) we obtain, form 0, the relationship Wmea(p, 2) — n

c'j (ß52).

(7.22)

Let us define an inner product

Po (Ymea,

X) = f Wmea(0, 2)X * (P, 2) PdO, z

0

with po being the range of p measurements and the summation covers all measured focus levels. The product of Eq. (7.22) with xn (p, 2) is Cj(in íX n') .

1 (mea,X )=

,

(7.23)

n

If we truncate the infinite series in Eq. (7.22) to N terms, and take the inner product of yrmea with the same N xn (p, 2) terms, Eq. (7.23) results in an N x N linear system

of equations. Its solution gives the least square approximation of the Fringe Zernike coefficients.

7.5.3 The Hartmann test The Hartmann test measures the relative shift of an array of imaged features [56, 57J. The ray aberration and subsequently the wave aberration can be deduced from the displacement data. Figure 7.5 illustrates the principle of the technique. The object contains an array of small openings, two of which are denoted by Ql and Q2 with their Gaussian images at Pl and P2. A screen S with a small aperture is inserted between the object and the first lens, such that the image of Ql (denoted by Pl) is formed by a small light bundle located at (a, b) in the pupil. According to Eq. (7.18), the displacement of Pl from Pl is proportional to the ray aberration at (a, b). Similarly, the shift of the image of Q2 (labeled P2) from its Gaussian image point P2 is proportional to the ray aberration at a different location of the pupil. Measuring the image displacements of all array openings allows us to sample the ray aberration over the pupil.


Measurement

147

plane Figure 7.5: Principle of the Hartmann technique.

To derive the wave aberration from the measured ray aberration, we assume that the aberration can be represented by the first N Zernike polynomials N

c1Z1(0 0).

_

,

i =1

The measured displacement and the Zernike coefficients are then related by N

(Axmea,Aymea) _

az;

N

aZ1 1

C^—, Y, C^ á g ^1 af =i

J

.

For an object array comprising M openings, the Above relation becomes the matrix equation Omea = Ac,

where A is a 2M x N matrix in which each entry is the gradient of a Zernike polynomial, c is the vector of Zernike coefficients, and Omea represents the measured dis -placemntd.Thisrxquaoncbeslvdigthaquremod.

7.5.4 Aberration monitor patterns For optical systems with low aberration levels the phase factor exp(+i2itc) can be approximated by the first two terms of its Taylor series expansion: e +i 2

1c4D(P,O) = 1 + i2t(D(P, 0) = 1 + i2i I c1Zj (0, 0) -

With an aberration monitor pattern [58] described by Ok(x,Y) = Zk(j,g)e 12 +g 2 4

7.4 A dielectric slab of refractive index n and thickness d is inserted between a spherical wavefront and its geometrical image point. Derive the equivalent aberration introduced by the slab. 7.5 What is the intensity perturbation at the center of an aberration monitor pattern if the pinhole transmittance is p = 0.5 and the Zernike coefficient is Ck = 0.01?


Chapter 8

Numerical Computation The theoretical developments in the last chapters have provided us with equations that describe optical imaging in photolithography. With these equations, images of objects can be computed in lieu of carrying out exposures. The possibility to simulate allows us to harness the power of affordable computers to predict images of object patterns, and to optimize the object and exposure configuration given a desired image. In this chapter we discuss common numerical formulations for imaging simulation.

8.1 Imaging equations Summarizing results of previous chapters, the imaging equation for a system with lateral magnification M is [from Eqs. (6.14) and (6.68)]:

I(x,y,z)=K f ... f J(.f,k)

7

(f+.f',g+g)H * (.f+.f",g+g')

6(.f' ,g' )M(ƒ+f' , +g' )E0 . O* (.Í" ,g")M* (ƒ+.f" ,g+$" )EÓ

e - i 2 "9 d f dgd f'dg d f"dg'

=Kƒ ... f Jj.f,g)H7 (J+j+g )H * (.f +.P',g+g") , Muk(f +f',b'+8')Myk(f +.f„ b'+g")EÊ^

i={x,y}

j{x,y} k={x,y,z}

6(f', g') Ô (J", g")e

-i2..p

d.Ídgd.f'dg d.f"dg', (8.1)

151


152

Numerical Computation

where

K=

nimage

for coupling image [see Eq. (6.14)],

nP

for resist image [see Eq. (6.68)],

is refractive index of image space, is real part of resist refractive index,

nimage

nP

H

i

(4.31)

is the effective source,

J(f,g) -

ni a

ge 2

e +^2^^(P,^)

if Pe < sin8obi,

(6.69)

otherwise,

0

(7.10)

0(010) = 1 cizi (P, 0), j=1

for coupling image [see Eq. (6.14)], ge [ ^ see E .6. 68], for res i s ta f ii mae l )

_ Mo (f , 9) M(f ' g) M s tack(,Í^g)

(

ro'^' M0yx MO =

Px.,+Pxllx

PYj+PYIIx

oxy

MOYY = Px.Ly+PxIIY PY1Y+PYIIY oz= M0 PXIIZ PYliz

ß2 Px

aß

1_ y2'

__

(6.13)

aß

Px1y 1-72'

Prix=

-

1-12 ,

a2

PyIy = 1—Y2 ,

(6.10)

Py =O,

Px1z = 0 ,

_ a2

Pxllx — 1-721

Y

PYllx = 1aß 2 2

aß7 PxIIY = 1— y2'

PY IY

Pxllz = al

PYllz = ß,

1—y2 '

(6.12)

—

—

MS. Msyx

Mstack =

Ms

Msxz MSY, F±Px +Fjy Pxllx FLPylx+F xyPYllx

(6.66)

= FIPxly+Fl P lly FL PY 1Y+F PYIIY

Fi Pxll z

Fj PYllz


Transmission cross-coefficient integration

153

F1 (p o z) _ t stack 1 re+ikP(dp—z)+p1bstacke—ikp(dp—z)1 Z substack L

(6.63)

1

,stack

11 (Po,z)

Îkubstack [e

+ikP(dp _z) — P llubstack e —ikf (dp—z)1

(6.64)

,stack F^ ( Po ,z ) = tisûbstack [e+' Pi

z (dp —z) +

1,

Pbstacke—ikP(dP—z)]

(6.65)

(6.59)—(6.62)

pij,zij,

dp is the photoresist thickness, = f sin9 obj,

ß=gsin8obj, y= 1—(f2 +g2 )sin e 8abj= pe = psinO obj,

= f2 + g2 , (P= (j'- 1")x+(S — g^)ƒ+(kp r— kp

„ )(z —

zo).

For computation of the coupling image, kp is the z-component of the wave vector in the coupling medium, and zo is zero. If our interest is in the resist image, kP is the z-component of the wave vector in the photoresist and zo is geometrical focal plane relative to the top of the photoresist. In writing Eq. (8.1), we assumed that • the spectrum is independent of the source point except for a translation proportional to the location of the effective source point [Eq. (4.40)]; • the diffracted field arising from each source point is proportional to the object spectrum at the appropriate spatial frequencies (f, g); • polarization of the diffracted field is independent of the object and the effective source; and • the ratio of the perpendicular to the parallel components is preserved during diffraction and propagation through the imaging system.

8.2 Transmission cross-coefficient integration To calculate image intensity at any point, we need to evaluate the sixfold integral of Eq (8.1), a daunting computation task. We can reduce the effort by assuming that the object is periodic in both the z- and 9-directions with periods p x and py . The spectrum then comprises discrete diffraction orders (see Exercise 8.1):

ó(M)=Er 5 (f — m ,g — n )O mn. n m

Px

Py


(8.2)


154

An isolated pattern is approximated by a periodic object with a large period. Such an approximation is acceptable, except for coherent illumination, because the complex degree of coherence is vanishingly small if the separation between points is large. The image of an isolated pattern under coherent illumination can be computed using Eq. (4.39). With a discrete spectrum, Eq. (8.1) becomes an equation with a double integral: — i2ncp Om n O* m„ n„ e

Î(,9,z) = K

=

n!"n',m'

fff(.f,g)H(.f+m'/ß ,g+n/ßy)JJ*(. +fn"/p,g+n"/ß ) Mik (f', g )Mk (J", g")EiE dfdg (8.3) t={X1'}

k= {.r _ .Z}

Om'n, O n,^ nn eî2' r TCCm of ;mrrn ri. z EiEY ,

K

j={.rp}

rt 1 .m^

k-{.r,y,z}

where +00

TCC W ;m„ n,,.Z =

LI

J(ƒ,g)H(J+m'/ß, + n'/ß)

H* (f + m"/ß, + n"/Py)Mik (5; z)MYk (Pë; z) d f dg. (8.4) Illumination in optical lithography is usually unpolarized. Since unpolarized light of intensity IQ is equivalent to two independent linearly polarized waves of amplitude IQ/2 vibrating in mutually perpendicular directions (see §6.1), I (x,Y, z) — 2 K

Om'n' Omnnn e —i2 '

TCCn„.mnnn. Z.

(8.5)

k={r.y.z}

To determine the intensity at a particular focus level z', we need to calculate the various transmission cross-coefficients TCC n ;m and sum up contributions from all relevant diffraction orders according to Eq. (8.3) or Eq. (8.5) [42, 61]. A possible computation procedure is: IIn°Z'

1. Calculate the object spectrum Ómn by Fourier transformation of Ô(,9) [see Eq. (4.33)]. 2. Compute the set of transmission cross-coefficients TCCm n,;m°n,,;Z by numerical quadrature. For each coefficient, the region of integration is defined by the effective source J(f,g) and the two displaced pupils H(f+m'/px ,g+n'/py ) and H(f +m"/px ,g+n"/p) ) (see Fig. 4.8). Downloaded from SPIE Digital Library on 25 Feb 2010 to 130.60.68.45. Terms of Use: http://spiedl.org/terms

Source points integration 155

3. Sum contributions from different polarization couplings [indexes i, j, and k in Eq. (8.3) or indexes i and k in Eq. (8.5)] and diffraction orders [indexes (m', n') and (m", n") in the equations]. Computation complexity of this transmission cross-coefficient integration method (also called Hopkins' approach [19]) depends on the number of diffraction orders. The range of m' and m" is proportional to px and the range of n' and n" to p y , meaning that the computation time tcompute scales asymptotically with the square of the object area A m = px x py : 2

tcompute °C (C1tTCC+C2tsum)Am,

where tTCC is the computation time of one transmission cross-coefficient and tsum is proportional to the time needed to perform the multiple summations in Eq. (8.3) or Eq. (8.5). Since each transmission cross-coefficient calculation involves a 2D quadrature that requires evaluation of the computation intensive functions Mik(OO;Z)Mjk(Pë;z) and H(f,g)

while ts„m involves only a few additions and multiplications, tTCC » is im

and

tcompute « tTCCAm

For calculating images produced by a specific imaging system with objects having the same periods pX and py , the transmission cross-coefficients can be precomputed and stored, thereby shortening the computation time such that tcompute °` tsumA 1

8.3 Source points integration Instead of performing numerical quadrature to compute transmission cross-coefficients, the source points integration approach (also called Abbe's method) transforms the sixfold integral of Eq. (8.1) into a fourfold integral, which can subsequently be approximated by a summation. Such transformation is possible because source points comprising the light source are mutually incoherent. Intensities produced by all source points aggregate to give the final image. This addition property allows us to express Eq. (8.1) in the following form:

Î(î,z) — ^^j(.Í,g) ^ ^H(i+f ,g+g^)^(Í^ g ) J '

2

(

M.Eo)e-^2n[f'x+g'y+kA^(z -zo)] dpdg dfdg. (8.6)


156


The quantity delimited by the absolute value is the electric field, generalized from Eq. (4.36) for polarization effects, arising from a source point of unit strength located at (I, g). If we denote the square of this quantity by Ícon , we can rewrite Eq. (8.6) as +00

I(,9,z) = I. (f,g)Icon(f,g;z)dfdS,

where Icon(f,;z) = K ^^H(f +f^,g+g)O(f^^g )

(

M Eo e —i2rz [!'z+g 9+Oz ' (z—zo)] d .

)

f'dg 2

We can perceive the overall image as the sum of an infinite number of weighed coherent images Ícon, with each component image arising from an effective point source of intensity f(fs , gs ) located at (fs, gs). For a periodic object with a discrete spectrum described by Eq. (8.2), +00

LI

i i

i

i

H(f+f ,g+8)o(f ,g)(M'Eo)e

f'x+'y+

2

'(z—zo)]

df dg

• E e—i2 [mx/PX +ny/ny +kP (z —zo)]

y) O ^ mn ( _ I H(.f + m/ß, g+ n/ ß)

0)

m,n

and Icon =

KI l (f +m/px,b' +nl b )O y

mn (M

Eo)

m,n

e —i2n [in /Px+n9/Py+kP' (z—zo)

2 ]

. (8.7)

In numerical computation, we approximate the effective source function by a finite number of point sources:

j(f,9)=Jf(J,g)

(8.8) s

where a s is the effective strength of the discretized point source located at 6(f — fs, g — gs). The image becomes I(x,f,z) =1 aslcon(Ís,b's;z),

(8.9)

s

and a computation procedure for this source points integration method can be:


157

Coherent decomposition

1. Calculate the object spectrum Ómn from Ô(1,9) [see Eq. (4.33)]. 2. Approximate the effective source J(f, g) by the discretized source JA(f, g) [see Eq. (8.8)]. 3. For each discrete source point, compute the component image according to Eq. (8.7). 4. Sum component images to obtain total intensity [see Eq. (8.9)]. For an effective source of area A and an object of area A m , the computation time scales according to S

tcompute — As . Am

8.4 Coherent decomposition We can also reduce the sixfold integral of Eq. (8.1) into a fourfold integral by (inverse) Fourier transformation of the constituting functions into their spatial-domain counterparts. For scalar imaging we essentially recover Eq. (4.32) from Eq. (4.35) (see also Exercise 4.4):

I(x,Y) =

Nl

w (x — zá,Y — yo,x — xö,Y — Yo) o(xo,yo)O' ( 1o,9o)d1od9d1ád9ö,

where

W (.óY10;x ool ,Yo) = f((xo xo^,Ya yo)H(xo,Yo)H * (xo,Yu)• —

—

The function W can be decomposed into a series of its eigenvectors cpk [62-64]: W (xo}Yo;x11 40) _ ^k(Pkl-oJo1 ) (Pk(xoJo} k=1

where Xk is the eigenvalue corresponding to the eigenvector cpk. With this series representation, the image becomes

I(19)—k^kkffff (Pk(x—z Y—Y)Ô(xY) (P*(x—.x',9-9")O*(xß,3 ")d2'd9' dx"dy"

_ k=1

ff (Pk(.x-x,9-f)O(x,Y)d 'dy

+°°

1

,

2



158

We can interpret each eigenvector as the transfer function of a coherent imaging system [compare with Eq. (4.1)]. The overall image is the weighed sum of the images produced by an infinite number of coherent systems. In situations where the magnitudes of a few eigenvalues are much larger than the rest, we can approximate the image as a finite sum of these dominant eigenvectors: K

1(1,9) =

2

(8.10)

kk (O ® Pk) (x, Y) I , (

k=1

where K is the number of significant eigenvectors, and the symbol ® denotes convolution. To calculate the image intensity at any point, we need to compute K 2D convolutions. Equation (8.10) is amenable to efficient computation of objects comprising polygons. For example, a rectangular pattern of foreground transmittance tf g and background transmittance tb g = 0 with vertices at (ui , 9i), (12, 91), (12, y2), and (11,92) can be described by O(x,Y) =tfg[Q( 1— zl,9 - 9l) — Q( 1-12,9 - 91)+

Q(1-12,9-92) — Q(x — x1) — y2)], where 1 Q(x ' Y) 0

ifx>0and9>0, otherwise,

is the quadrant function. An object consisting of N rectangles can be described by

N

o(x,Y) _ tfg ) [Q(x -1 1n) ,y — Yins ) — Q(.x — x2 'Y — Yins )+ n=1

Q(x—x2n^'Y—Y2n)) — Q(x — x1n ^,Y — Y2

where the superscript (n) indexes the rectangular shapes. Substituting this object description into Eq. (8.10) results in I(x,Y) = I Xk ( 6 ®^k)(xiY) 2 k=1 K IN

(n) (n) (n) ^ (n) 1vk(x—x1 ,9 91 ) — lk(X -12 'YY1 )+

_ ^?k k=1

-

I n=1 2 (n) (n) (n) (n) 1 1Vk(x —.z2 ,Y — Y2 ) — Mfk(x — x 1 ,y—y 2 ) J , (8.11)

where 11k(1,9) =

Q(x,y) ®(Pk(1,Y)


Object spectrum

159

is the convolution of the quadrant function with the kt eigenvector. As the functions 1rk(1,9) are independent of the object, they can be precomputed (for each particular exposure configuration). Image calculation becomes a double summation over the significant eigenvectors [index k in Eq. (8.11)] and over the rectangular shapes [index n in Eq. (8.11)]. Each inner loop requires four function look-ups and three additions, and each outer loop needs two multiplications. The computation cost is tcompute = NK(4tlook-up + 3 tadd) + 2Ktmuit + (K —1)tadd [+Kty^], where tlook-ups tadd, tm„lt, and t, are respectively the time needed for function lookup, addition, multiplication, and computation of a convoluted function 111k (1,9). Assuming the functions yrk(1,9) are calculated in advance, the computation time scales linearly with the number of object vertices asymptotically. The procedure for image computation by the coherent decomposition approach is:

1. Determine the eigensystem of W, and the number of retained eigenvectors K. 2. Precompute the functions yrk (x, y) for all significant eigenvectors. 3. Compute the image according to Eq. (8.11).

8.5 Object spectrum The object spectrum is the Fourier transform of the field transmitted by the object under the illumination of a unit-amplitude wave. This field behaves in a complicated manner, varying continuously and oscillating in the vicinity of transitions between regions of different transmittance. To simplify image computation, we make the approximation, similar to Kirchhoff's in his derivation of the diffraction integral [Eq. (3.7)], that the transmitted field changes abruptly according to the transmittance of the object. This thin-object approximation (also called thin-mask approximation), of which Eqs. (5.5) and (5.26) are examples, suffices for patterns sizes large compared with the wavelength. When the object dimensions are on the order of or smaller than the wavelength, we should treat object transmission as a boundary value problem based on Maxwell's equations, and solve for the transmitted fields numerically. For example, for the space depicted in Fig. 8.1, we need to find the field across the line AB subject to an incident radiation. The computation should consider material properties, including the refractive indexes of the mask substrate nglass, the opaque layer n opa , and the object space medium nobject, as well as the thickness d opa of the light-blocking material.



160

Figure 8.1: The field along AB can be determined by solving Maxwell's equations. The thin-mask approximation simplifies object spectrum computation.

Figure 8.2: Time-domain finite-difference is one approach for computing the field along AB in Fig. 8.1.

Time-domain finite-difference is one approach for solving such a numerical problem [65-68].' Using the integral form of Maxwell's equations [Eq. (1.6)], we solve for the six electromagnetic field components

e = ( ex

,

ey , ez )

and

h = (hX hy , hZ ,

)

by decomposing the object into a grid and assigning one field component to each grid point, as illustrated in Fig. 8.2. For a cubic grid in which Ax = Ay = Az, the surface and line integrals in Eq. (1.6) are evaluated on squares. Referring to Fig. 8.3, the z-component of the electric field ez(iAx,joy,kAz) = ez(i,j,k) =e(,,)

can be expressed in terms of the x- and y-components of the magnetic field:

á

(£e t +óc)ez(i,j,k)

= [h,(i,j— 1/2,k) —h,(i, j+ 1/2,k)+ by (i+1/2,j,k)—by (i-1/2,j,k)]Ax. (8.12)

'Besides the time-domain finite-difference approach, the boundary value problem can be solved by finite-element [69] and frequency-domain methods [70-75].


Object spectrum

161

hX (i,j+1 /2,k) Ay/2 b y (i -1 /2,j,k)—e, k)—h y (i +1 /2,j,k) Y hJJ-1/2,k)

ZLX

Ax/2

Figure 8.3: The z-component of the electric field is expressed in terms of the x- and y-components of the magnetic field.

This equation is derived under the assumption that ez (i, j, k) is constant over the square surface and that the magnetic field components are constant along each line segment comprising the square. To obtain a time discretization of Eq. (8.12), we assume that the electric field components are constant within the time period [nAt, (n + 1)At), and that the magnetic field components are fixed within the duration [(n — 1/2)At, (n + 1/2)At). This formulation results in the time-domain finite-difference equation: eZ

+1(> >) =

aez (, ,) + ß [hx +1/2 (, —1 j2, ) — hz + 1/2

where

_

a

(,1/2,) +hy

+i/2 ( 1 / 2

,,) — by

+i /2 (

-1 / 2

,,)11 (8.13)

2Ee — G At _ At 2 and 2£e + 6cAt ß dx 2Ee + 6,Ot

Following the same procedure, the equations for the other five components are ey +1 ( 1/2,1/2)= aey(,1 /2,1/2) +ß[hz +i /2 (-1/2,1/2,1/2)— h 1/2 (1/2 1/2,1/2) +h 112 ( 1/2,1) —hX+z /2(,1/2,)], (8.14)

e

x +l (1 /2„ 1 /2) = aex (1 /2„ 1 /2) + [hy+i

/2 (1 / 2

„) —

by +1/2 ( 1/2 1) +hz +i /2 (1/2,1/2,1/2) —hz +1/2 (1/2,-1/2,1/2)1, (8.15) hz +1/2(1/2 1/2,1/2) = hz -1 / 2 (1/2 1/2,1/2)—

At1 [e,'(1/2„ 1/2) —eX(1/2,1, l/2)+ey(1,1/2,1/2) —ey(,1 /2,1/2)], (8.16) Downloaded from SPIE Digital Library on 25 Feb 2010 to 130.60.68.45. Terms of Use: http://spiedl.org/terms


162

n-1/2

n+ l2

by

(1/2 )=hy l „ At 1

hx+l/2 (,

(1/2,,)—

[e?(„)—ez(1„)+ey(1/2„1/2)—ey(1/2„-1/2),, (8.17)

1 / 2 ,) = hX—' 12 (^ 1/2,)— At

I ez(,1,)—ez(„),. (8.18)

The field transmitted by the object can be computed by iteration of the above equations. To confine the computation volume, absorbing boundary conditions are needed on the border surfaces of the simulation domain. These conditions result in a different set of iteration equations compared with Eqs. (8.13)—(8.18). Please refer to the literature [76] [77] for details. Electromagnetic simulation using the time-domain finite-difference approach thus comprises two primary steps: 1. Discretize simulation volume; assign (material) parameters to each grid point. 2. Iterate field components according to Eqs. (8.13)—(8.18) or boundary conditions until convergence.

8.6 Remarks The formulation of Eq. (8.1) assumes that the wafer stack is a stratified medium. In situations where there is significant wafer topography effects, the analysis of §6.4 is inadequate. There is generally no close-form solution. The resist image must be computed rigorously by consideration of light coupling into the photoresist in the presence of wafer dielectrics [78, 79]. The time-domain finite-difference technique described in §8.5 is one method for such calculation. Besides images in photoresists, we are often interested in modeling photoresist dissolution. The effects of resist processing can be simulated by two classes of techniques. Rigorous approaches [80-84] attempt to describe the photoresist dissolution process with first-principle formulation involving reaction and kinetics of chemical species within the resist. Phenomenological methods [85-88], on the other hand, seek to model photoresist behaviors heuristically by calibrating actual resist performance with model parameters that may bear little relationship with resist chemistry and physics. Interested readers may wish to consult these references for details.

Exercises 8.1 Show that the spectrum of a periodic object with periods px and py in the i- and 9-directions is given by Eq. (8.2).


Exercises

163

8.2 Derive the spectrum of a periodic rectangular feature with periods pX and py in the x- and 9-directions. The coordinates of the lower left and upper right corners of one of the patterns are (io,9o) and (xl,yl). Each rectangle has a transmittance of tfg with the background transmittance being tbg . 8.3 Is it possible to simulate immersion imaging with a simulator that assumes the refractive index of the coupling medium to be one? If so, how? 8.4 Using Stoke's theorem (see Exercise 1.5), develop a formula to compute the area of a polygon.


Chapter 9

Variabilities In replicating an integrated circuit layout during fabrication, the same object shapes are often delineated numerous times. Because of unavoidable variabilities of a manufacturing process, the delineated shapes are generally different from the nominal shapes and from one another. This variability should be kept under the specified tolerance according to which integrated circuits are designed. Too much deviation causes circuit failure. An understanding of the various causes of variation is helpful in devising means to reduce, stabilize, and compensate for the undesirable variation. Although all processing steps (such as deposition, lithography, etching, and chemical-mechanical polishing) contribute to patterning nonuniformity, we focus on variabilities arising from optical imaging, since, with both layout shapes and image tolerance shrinking rapidly compared with Xo/NA, control of image variabilities is of increasing concern. Lithography becomes more difficult with decreasing kl and k1h_P;tch.

9.1 Categorization We can classify the causes of lithography variability into two categories. One affects object shapes located in identical environments, and the other impacts the same object shapes situated within distinct configurations of neighboring shapes. Let us call the effects of the former fluctuations and those of the latter inherent variations. The same cause can result in both fluctuation and inherent variation. On the other hand, a fluctuation and an inherent variation may have similar manifestations. To distinguish fluctuation from inherent variation we must first expound the meaning of environment. Let us define afeature as an object shape within a particular configuration of neighboring shapes. The shapes A and B in Fig. 9.1, although identical, are distinct features because their environments of shapes are different. The periodic space described by Eq. (5.5) is a feature that is fully specified by its foreground and background transmittance tf g and tb g , size d, and period ßx. A peri-

165


Variabilities

166

Al

A2

A

gi

B B2 B2

Á

A

B^

l A2

Bj

Figure 9.1: A feature is an object shape within a particular configuration of neighboring shapes. A and A' are the same feature if we identify edge Al with Ai and A2 with A. odic line feature is similarly determined by these four parameters:

OX(i) = tbg if Ix—npx l < cl/2, n E Z, g otherwise.

ti

(9.1)

The orientation of a feature does not constitute its environment. For example, a y varying periodic line described by -

^

tb g if IY — nPy I < d/2, n E Z, y ^ y ^ — tfg otherwise,

is the same feature as that described by Eq. (9.1), provided they have the same tb g , tfg , and cl, and px = py . With this understanding of "feature," we can define fluctuation as the variability of the same feature, and inherent variation as the difference, excluding fluctuation, between distinct features .of the same object shape. Referring to Fig. 9.1, the difference between the delineated shapes of A and A' (and that between those of B and B') is fluctuation. So are differentiations between their delineated shapes across an exposure field, from wafer to wafer, and from lot to lot. The distinction between the averaged image of all replicas of A and A' (and those of B and B') is inherent variation. By this demarcation, we can perceive fluctuations as arising from engineering imperfection while inherent variations as consequences of the laws of physics. The variation in the images of periodic spaces with the same nominal dimension d but different periodicities follows from Eq. (4.35) because of differences in their spectra. But no physical law dictates that images of a feature at two points in the field of an optical instrument should differ. The difference one may observe can be caused by aberration fluctuations of the imaging system.


Proximity effect

167

A

pthreshold

w

O

-1.0

-0.5

0.0

0.5

1.0

x (7JNA)

Figure 9.2: Method for determining the size and placement of a space image.

To quantify image variability, it is useful to convert each image into a number or a set of numbers that represents a relevant quality of the image. Metrics for this purpose include contrast, depth of focus, exposure latitude, normalized image logslope [89], exposure-defocus window [38], and total window [90]. These metrics can be determined from measured or simulated data. As an example, Fig. 9.2 illustrates a method that derives image size and placement from a computed image. The image of a feature, a periodic space in this case, is first computed using the techniques described in Chapter 8. The size iv of the image is the distance between the intersections of the simulated image and a threshold intensity 'threshold.' The placement error 0 can be defined as the distance between the center of the object shape and the midpoint of the two intersections. We use this method in our investigation of variabilities in subsequent sections of this chapter.

9.2 Proximity effect The spectrum of an object spreads as its size decreases. For dimensions on the order of or smaller than the wavelength, the pupil cuts off spatial frequencies that carry a sizable fraction of the light energy transmitted through the object. The image becomes distorted. One manifestation of optical distortion is proximity effect, an inherent variation where the image size w of a shape changes depending on its environment. A typical scenario is shown in Fig. 9.3, which plots the image sizes of nominally d = 0.6 spaces [Eq. (5.5)] and lines [Eq. (9.1)] as a function of period. As the period changes 'We can relate !threshold to the exposure dose Dprint that is supplied in an exposure. Denoting the dose to clear a positive resist or to harden a negative resist with a completely transmitting mask by Do, the dose D pri ❑t is Dprint

Do 'threshold

This approximation becomes more accurate with increasing photoresist contrast [85].


168

Variabilities

❑--❑

space

Q o Z

0-0

.

line

N

N_ (0

O N ^ N

.1

O

1.0

2.0

3.0

4.0

period (RINA)

Figure 9.3: Proximity effect is the change in image size depending on the environ-

ment of an object shape. The features are nominally d = 0.6 spaces and lines with different periods.

the image sizes of periodic lines vary by as much as 20%. ü Figure 9.3 suggests that periodic spaces exhibit less proximity effect than periodic lines. We can use Eq. (4.32) to investigate whether this is generally true. Let us first contrast the imaging of two periodic spaces of the same size dbut different periods: pi = 2d and P2 = 4d. Imagine that the object is divided into atomic regions of width d, as illustrated in Fig. 9.4(a). The integral in Eq. (4.32) can be converted into a sum involving interaction of these atomic regions. For tfg = 1 and tb g = 0, the intensity at space Ao is roughly +00

I2d 0e ^ 1 J(0, 0)H(2nd, 0)uY* (2nó, 0) } n=--

I, l (2 [i — j]d, o) H(2id, 0)R* (2 jd, 0) (9.2) iii

and +00

,space f(0, 0)H(4nd, 0)H* (4nd, 0)+ n=--

J(4[i— j]d,0)H(4id,0)PI*(4jó,0). (9.3) i0i

Except for coherent illumination, the mutual intensity J(x,9) generally decreases with increasing 1.xJ and y^. For circular illumination with a partial coherence factor 6 [Eq. (5.13)], the first dark ring of f(î,9) occurs at [see Eq. (4.29)] r

_

z2 + y2

_ 0.61 6

"Proximity effect generally means any type of image shape distortion including pattern shortening and corner rounding. But it has the specific meaning of image size variation with period in this context.


Proximity effect

169

2d 0

0

— — 1ine — —

0

0

0

O 0 0 0 0 0

Ap

A2

A4

A -1 A l

pl =2c1

pl = 2d

4d O

A3 A5

' line \ —

O

0

Ao

A4

00 000

OOO 00

A - ,A - ,A -1 A l A 2 A3

A5 A6

pl =4J

(a) space Figure 9.4:

(b) line

Spaces generally exhibit less proximity effect than lines.

If we assume 6= 0.8 and d=0.4, f(î,9)0

for r > 0.8 = 2d.

(9.4)

With the approximation of Eq. (9.4), the intensities expressed by Eqs. (9.2) and (9.3) become I2ppace.

+00 N

j(0, 0)H(2nd, 0)H* (2nd, 0),

and

n=--

Isppace

+00

N

f(0,0)hH(4nd0)H*(4nó,0). n=--

Since the point spread function H(z, y) generally decreases with increasing r, with the first zero at P = 0.61 for a circular aperture [Eq. (3.24)], only the first term is significant in both sums. The intensities of the two spaces are approximately equal. The situation is different for periodic lines. Dividing the object into atomic units in a similar manner [Fig. 9.4(b)], the image of a line of size d and period pl = 2d is roughly

72ó

e

—

If(o, 0)R( — [n2 + 1], 0)R ( — { n2 + l], ) +

n=--

ll,

([i— j]2d,0)H(— [i2+1]d,0)H*(—[j2+1]d,0).

i0i

The sum over i and j can again be neglected due to Eq. (9.4). But the double sum is non-negligible for a line of period p2 = 4d. For this feature, the atomic units such as A 1 and A2 are separated only by d. They interact and contribute to an image that can be much different from the image of a line of period pi = 2d. Downloaded from SPIE Digital Library on 25 Feb 2010 to 130.60.68.45. Terms of Use: http://spiedl.org/terms

Variabilities

170

In addition to mask tone, proximity effect also depends on illumination configuration, mask technology, and wafer stack through their presence in Eq. (8.1). Although the exact behavior depends on the detailed interaction between the object spectrum and optical system [Eq. (5.1)], proximity effect generally increases with decreasing kl and k1h_p;«h factors, and with an increasing degree of illumination coherence. For example, the first zero of the complex degree of coherence under partially coherent illumination [Eq. (5.13)] is, according to Eq. (4.29), r=0.61 ko aNA The range of non-negligible optical interaction increases with decreasing partial coherence factor.

9.3 Object variabilities (photomask errors) 9.3.1 Dimensional error Photomask dimensional error is the deviation of a mask-feature size from the designed size. From a mask maker's viewpoint, dimensional error comprises both fluctuations and inherent variations. For the purpose of our discussion, all mask dimensional-error components are considered fluctuations. When an object pattern is large, its image is scaled exactly, except for a constant bias bo, by the magnification of the projection optics M. The image size wo varies linearly with the nominal mask dimension do with a slope equal to the magnification M: wo = Mdo + bo. Deviation of mask dimension from the nominal (Ad) results in a scaled linewidth error (Ow): Aw=w —wo = M(d — do) + (bo — bo) = MAd.

A unit change in the mask dimension corresponds to a unit change in the image size scaled by the magnification of the exposure system, a phenomenon illustrated in that portion of Fig. 9.5(a) where mask dimensions are large. The effect of mask dimensional errors is diminished in a reduction system. Image sensitivity to mask variation generally increases with decreasing feature size. When the object size is small, the effective mask dimensional error MAd is magnified [mask dimensions smaller than 0.5Xo/(NA x M) in Fig. 9.5(a)]. The severity of this amplification is described by the mask -error factor (MEF) [also


Object variabilities (photomask errors)

171

U,

U,

0—o space

Z o

ó ^

N

w

0—❑

Ü

N N

square

Q

a)

N

^

E ^? 0

N

co

(C

E

1.0

0.5

1.5

0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0

d (GINA)

mask dimension [X/(NA x M)j

(b)

(a)

Figure 9.5: (a) Image size varies linearly with mask dimension for large features, but sensitivity to mask variation increases with decreasing features size. The features are periodic spaces with ßx = 2d [Eq. (5.7)]; and the magnification factor is assumed to be 0.25. (b) The mask-error factor typically increases with decreasing feature size, although the exact behavior depends on all aspects of the imaging process, including illumination configuration and mask technology. [From A. Wong, et al., "The mask error factor in optical lithography," IEEE Transactions on Semiconductor Manufacturing, vol. 13, no. 1, pp. 76-87 (Feb. 2000), copyright (2004) IEEE.]

called mask-error enhancement factor (MEEF)] [91, 92], such that the relation between mask dimensional error and image size fluctuation becomes Aw = MEF x MAd. As an example of mask-error factor dependence on object size cl, Fig. 9.5(b) plots MEF for the same spaces as those in Fig. 9.5(a) as well as for a square [Eq. (5.26)]. The MEF is I for large features, but increases from unity as d decreases below 0.75 for the square and 0.5 for the space. Because of its amplification for small object sizes, mask dimensional error is a significant fluctuation contributor in low-k1 optical lithography. iî "'In addition to its dependence on object size, the mask error factor also depends on feature type and its environment, photomask technology (chromium-on-glass, attenuated phase-shifting mask, and alternating phase-shifting mask), and illumination configuration (on-axis and off-axis). In general, the factor increases with decreasing kl and k1half-P,«h. One exception is the imaging of a narrow, sparse line by alternating phase-shifting masks in which the MEF approaches zero with reduction of the chromium size [931.


172

Variabilities

0

d=0.10 ?/NA

CR ó

---------------d=0.051JNA

>'(0 to o

N •_

t Ó

N O

-2

-1

0

1

2

x (AJNA)

Figure 9.6: The image intensity is quartered when the size of a small isolated space is halved [23].

Mask-error amplification arises from image integrity degradation. Consider a 1D isolated space of width d: _ 1

^x(x) 0

if 1 xß < J/2,

otherwise.

Its spectrum is ô(f) =dsinc(fd). In the limit of small d, lim Óx (f) = d, d-*O

and the image is [see Eq. (4.37)] I () d

ff

TCCX(t ; ") e -i21r [íf'-f")z1 d f' d J"

—

(9.5)

= d^L(J, H). The image shape is determined by L(J, H), a quantity sometimes called the line spread function. Since L(J, H) is a function of the effective source intensity dis J and the coherent transfer function H, the image shape depends only on-tribuon exposure system parameters. Modification of the object has no effect on the image shape, except for scaling of the intensity by the square of its size. Halving the size quarters the intensity. This phenomenon is illustrated in Fig. 9.6 for two spaces of sizes d= 0.1 and 0.05. The quadratic dependence of image intensity on size described by Eq. (9.5) is the primary physical cause of the mask error factor. We can qualitatively ascertain



173

O

N Y c0 Q

Eq. (9.5) W) Eq. (4.38)

—

Ó .^ 6 0 O 0 d r

a) .^ 0 D)

O

0.0

0.2

0.4

0.6

0.8

1.0

d (XJNA)

Figure 9.7: Severity of mask error amplification can be ascertained by comparing the peak image intensity predicted by Eq. (4.37) and Eq. (9.5). [From A. Wong, et al., "The mask error factor in optical lithography," IEEE Transactions on Semiconductor Manufacturing, vol. 13, no. 1, pp. 76-87 (Feb. 2000), copyright (2004) IEEE.]

the severity of error amplification by comparing the square root of the actual image peak intensity with that of Eq. (9.5). A large difference means that mask errors are not amplified much, whereas closeness in the numbers indicate large effective magnification. Plotted in Fig. 9.7, the difference between the straight line, which represents Eq. (9.5), and the curve, which represents the actual image peak intensity, is big for large spaces. As the size decreases from d = 0.5, the curve and the line begins to converge. Below d = 0.2, changing the object no longer changes the image shape. Image control is lost. A similar consideration applies to squares with minor changes of the imaging equation (see Exercise 5.10):

I^x,9) _ [

yJ (r)

r

J

Z d4 •

In the limit of small cd, the image shape is solely determined by the point spread function while the intensity varies with the size. Rather than a quadratic dependence, the intensity is proportional to the fourth power of the linear dimension of the square contact hole. Because of this fourth power dependence, the MEF of contact holes starts to increase from unity at a larger dimension and has a higher value than that of spaces at the same dimension. The MEF can be as high as 4 for 0.5X0/NA contact holes, offsetting the advantage of 4X reduction systems. Although loss in coupling image integrity is the primary cause of the MEF, all factors causing image quality degradation, whether coupling, resist, or latent, worsens the MEF. Focus error, aberrations, excessive diffusion of chemical species in photoresists, and low photoresist contrast all increase the MEF.


174

Variabilities

incident radiation

Figure 9.8: An alternating phase-shifting mask fabricated using a subtractive process.

9.3.2 Phase and transmission errors Phase and transmission errors are results of nonuniformities of phase-shifting mask dielectrics. They can arise from mask topography, imperfection of defect repair, and thickness nonuniformity of photomask films. As an example of phase and transmission errors, consider an alternating phaseshifting mask fabricated on a chromium-on-glass substrate using a subtractive process. Illustrated in Fig. 9.8, the 180 deg regions are etched areas of the fused silica substrate. The thickness of material removed detch is determined by requiring an optical path difference of

2n+1 ^

aEZ 2 (9.6)

between light passing through the 0deg and 180 deg phase regions. For a path difference of 2/2, a first-order estimation of the required etch depth detch is

xo

nglassdetch — nobjectdetch =

i

2nobject

detch =

(9.7)

2nobject(nglass — nobject)

where nglass is the refractive index of the fused silica substrate at the exposure wavelength and nobject is the refractive index of the object space. For wavelengths of interest in optical lithography (436, 365, 248, 193, and 157 nm),

nglass

=

1.5,

nobject = 1 ,

and detch = kO

The etch depth is approximately equal to the exposure wavelength. The phase regions are asymmetric because the 180 deg areas are etched while the 0 deg are not [94-96]. This imbalance causes transmission and phase errors. Figure 9.9 shows the simulated field within an alternating mask using the time-domain finite-difference technique described in §8.5. The incident light is reflected from Downloaded from SPIE Digital Library on 25 Feb 2010 to 130.60.68.45. Terms of Use: http://spiedl.org/terms


175

Figure 9.9: The Odeg and 180 deg phase regions are asymmetric because of mask edge scattering [23]. the chromium areas, resulting in the standing waves. Transmitted fields through the two openings are not identical. The darker color in the unetched 0 deg region on the left compared with the etched 180 deg opening on the right suggests higher transmittance of the 0deg region. Detail analysis also reveals that the phase difference between the regions is not exactly 180 deg, even when the etch depth is that given by Eq. (9.7). The effects of phase and transmission errors on images are demonstrated in Fig. 9.10. For an alternating phase-shifting mask with only transmission but no phase error [Fig. 9.10(a)], the region with lower transmission gives a lower peak intensity for all focus levels. With only phase error [Fig. 9.10(b)], there is no intensity imbalance at nominal focus. The peak intensity of one opening is greater than the other with focus error in one direction, and the difference is reversed when the focus varies in the other direction. The effect of pure phase error is a focusdependent peak intensity difference. With a combination of transmission and phase errors [Fig. 9.10(c)], the peak intensities at best focus are not balanced because of transmission error. The imbalance is not maintained through focus due to phase error. With intensity imbalance, spaces that have the same nominal dimension print with different sizes in positive photoresists. Lines are shifted; but their sizes remain mostly unchanged. The effects are reversed in negative photoresists: resist islands differ in size while clear areas are displaced.

9.3.3 Edge roughness We mean by edge roughness the minute and random undulation of the contours of a photomask feature. By analyzing its spectrum, we can classify roughness into two categories: low-frequency unevenness that is resolved and replicated onto the wafer, and high-frequency coarseness where spatial frequencies are beyond resolution by the exposure configuration in use (on the order of 2NA/X0; see §5.6). Although these high-frequency components are not reproduced directly, they may collectively cause an effective mask dimensional error.


176

Variabilities

0

k

ó

positive

focus

O

4 O

C N O

C N

r

d

Q)

-0.8 -0.4

0.0

0.4

-0.8 -0.4 0.0

0.8

x (a/NA)

0.4

0.8

x ()/NA)

40.00.40.8

x (AJNA)

Lo n ó

nominal focus

_T mus,

_T

C

'(/1 C

C N

C

W O d

0

m ó

d

C N ó

ó -0.8 -0.4

T

0.0

0.4

0.8 -0.4

0.8

x ()JNA)

x ()/NA)

0.0

0.4

0.8

0.4

0.8

x (AMA) N r ó

_T í1 O7

negative focus

^N O

C N

^N O

. O d N

C N

C N

ó

O

-0.8 -0.4

0.0

0.4

0.8

x ()/NA)

(a) transmission

-0.8 -0.4

0.0

0.4

0.8

-0.8 -0.4

x (A/NA)

(b) phase

0.0

x (2JNA)

(c) both

Figure 9.10: Effects of phase and transmission errors [23].

9.4 Polarization effects Not only does photomask topography cause phase and transmission errors, as demonstrated in §9.3.2, it also changes the degree of polarization of light. Let us begin by studying transmission of linearly-polarized light through an opening, as illustrated in Fig. 9.11(a). The incident light can be polarized with the electric field vibrating parallel or perpendicular to the edge of the opaque layer, which is assumed to be chromium. Figure 9.11(b) plots the fraction of energy transmitted through the opening normalized to its width for both polarizations, where TE (transverse electric) denotes electric field vibration parallel to the chromium edge, and TM (transverse magnetic) represents perpendicular vibration. The TM polarization shows higher transmission for all opening widths; the fractional difference increases with decreasing opening size [97]. Higher transmittance of the TM polarization may be traced to the electromagnetic boundary conditions at the air-chromium interface. Because the absorbing chromium layer is metallic-like, the electric field component parallel to the chromium edge is close to zero, whereas the perpendicular component is not (see


Illumination

177

0 ^Q

TE O

TM

a,=248 nm

TE O —TM nglass ...:..:.::

C

cs

L

y

0

0

ir_

U

.

c chromium

N cs

0

250

500

750

1000

opening width (nm) (a)

(b)

Figure 9.11: (a) Structure of an opening. (b) Fraction of energy transmitted through the opening normalized to its width for the TE and TM polarizations [97]. Exercise 6.9). Unpolarized light becomes partially TM polarized upon transmission through photomasks. The degree of polarization increases with decreasing opening width. Since images of TM-polarized light have lower contrast than those of the TE polarization (see §6.7), changes in the degree of polarization degrades images by emphasizing the TM component [98]. Despite adversely affecting image quality, photomask-induced partial polarization does not cause inherent variation or fluctuation for unpolarized illumination. Under polarized illumination, however, the degree of polarization of transmitted light is a function of feature orientation. Variability would manifest as orientationdependent image shapes, convolving with the effects of astigmatism (see §7.4).

9.5 Illumination When the effective source intensity distribution is asymmetric with respect to the optic axis, the exposure system suffers from nontelecentricity. Since an asymmetric effective source implies a preponderance of light incident from some off-axis direction, we can understand the fluctuation arising from nontelecentricity by studying imaging with an off-axis point source, as illustrated by the schematics of Fig. 9.12. Imagine an image formed from a bundle of rays centered at the undiffracted direction indicated by the bold arrow in the figure. The lateral position of this center ray shifts with the longitudinal (focus) planes. Nontelecentricity causes image placement error that increases with defocusing.


178

Variabilities

out of focus^ \i

best focus

Figure 9.12: The center image-forming ray shifts with focus for an off-axis source point [23].

A remedy for such focus-dependent image shift is to maintain source symmetry, such that the image shift corresponding to one source point is compensated by that of the mirror source, resulting in a final image that has no net displacement. This is the rationale behind dipole illumination (see §5.3). One can observe the effective source intensity distribution on the image plane using the method illustrated in Fig. 9.13 [99]. Rays from the light source impinge upon a clear reticle with an opaque spot P on the back. With Köhler illumination, the intensity on the object plane would have been uniform in the absence of the opaque spot. Let us denote this intensity by I. With the opaque spot, the radiance on the object plane diminishes. For example, point B receives light from all parts of the source except for the blocked bundle AP. The intensity distribution on the object plane is Iobject(x,y) = Jo

— ff

J(.f, g) d.fdg,

where S2 is the solid angle that P subtends at (x, y), and f and g define the propagation direction of the light beam AP. For a small opaque spot, Iobject(x,y)

= IO — C J(f, g)

with c being a constant. The relationship between (x, y) and (f, g) is (see Exercise 9.1): nob'ectsin 15

nglass r

_(9.8)

t 2 + r2

where r = x2 +y2 , sine = \/ f2 + g2/f, and t is the thickness of the reticle. If we define Jmax as the maximum intensity of the effective source and Ia n as the corresponding minimum intensity on the object plane, then Irvin = Jo — cJ , and f(f,g) Jmax

Io I(x,y) — lo — 'min —


Pupil

179

object plane

Figure 9.13: An opaque spot on the back of a clear reticle creates a reverse-tone image of the effective source on the image plane.

The intensity distribution of the effective source is replicated in the object, which is in turn reproduced onto the image plane.

9.6 Pupil Aberration contributes to fluctuation in two ways: image variability caused by aberration itself, and fluctuation resulting from across-field aberration changes. In the presence of coma, for example, the images of a pair of nominally identical spaces described by if d/2 < Ill < 3d/2 tfg (9.9) Óx(z) _ tb g otherwise, are different. This asymmetry changes as the amount of coma varies across the image field.'° Vignetting is another defect that can cause fluctuation. Usually the result of cursory lens design, some image-forming light rays are not captured by the lens when vignetting occurs. The peripheral of the field is especially prone to this defect.

9.7 Focus Focus variation arises from focal plane deviation (variation of Z4 across the image field), astigmatism fluctuation (variation of Z5 and Z6 across the exposure field), wafer topography, mask flatness variation, focus setting and auto-leveling errors, wafer and chuck nonflatness, lens heating, barometric pressure and other environ1 °Brunner [100], Progler and Wheeler [101], and Flagello, et al. [102] provide more discussions on the lithography effects of aberrations.


Variabilities

180

mental variations. A defocused image is blurred, resulting in image quality degradation. Although decreasing X and increasing NA improve resolution of imaging systems, these measures adversely impact depth of focus—the maximum amount of tolerable focus variation. We can estimate how depth of focus scales with 4 and NA using Rayleigh's criterion [103]. Consider the situation shown in Fig. 9.14, where two light rays, one from the center of the pupil and the other from the edge of the aperture, interfere to form a sharp image on the plane zo. This image degrades as the observation plane moves away from zo. At a plane a distance z from zo, the relative phase change between the two rays is 21t

nimage (z — z cos 0obj)

rad.

Rayleigh used the heuristic criterion that the image is sufficiently blurred when the phase change is 90 deg, namely, the optical path difference is a quarter wavelength. According to this criterion, the Rayleigh unit of depth of focus is a distance R. U. such that (see Exercise 9.2) R. U. = -0 1

2 I

1 + 1— sin e 8obi /

4 nimage sin B obt \ 1 1

4 NA • sin S abi

(9.10) 1 — sin2 9abj .

In improving the resolution by reducing X and increasing NA the focus tolerance diminishes. The decrease is linear with wavelength, but depends on the manner in which the numerical aperture changes. Increasing the image space refractive index decreases depth of focus linearly with 1/ni mage , namely, linearly with 1/NA. On the other hand, increasing the semi-aperture angle reduces focus tolerance faster than 1 / sin 2 9 obj, namely, quadratically with 1/NA. When the semi-aperture angle is small, the Rayleigh unit is approximately R. U. =

2 sint 0obj

(9.11)

As sin0 obi approaches one, the Rayleigh unit approaches (X/4) rather than (?/2). The depth of focus is further halved because of the (1 + 1— sin e A obi) factor. Obtaining a larger numerical aperture by increasing nimge impacts depth-of-focus less adversely than increasing sine o b^. °Although defocus is one form of aberrations, we consider it separately because of the varied causes and its historical importance in optical lithography.


Focus

181

pupil ^,' 1 0 1 — 0 2 1= ^/4

eob; nimage

I 01 1 z o z-zo nimage

Figure 9.14: Rayleigh's depth of focus criterion.

Figure 9.15: The focus monitor comprises lines bordered by transmitting regions

that are 90deg out of phase. Similar to the definition of kl for minimum image size [see Eq. (5.27)], we use the k2 factor to denote the depth of focus of a lithography process [104]: depth of focus = k2 R. U. k2

2

for low-NA imaging. 2 sin 9 obj

Focus sensitivity of a process and hence its difficulty increase with decreasing k2. We can measure focus crudely by exposing isolated lines of decreasing dimensions onto a positive photoresist through a series of focus levels [105]. The focus value that prints the narrowest line is the best focus. For finer determination of focus, we can use the focus monitor [106]. The measurement structure consists of a line bordered by two transmitting regions that are 90 deg out of phase, as shown in Fig. 9.15. The 90 deg phase makes the lines Ll and L2 shift in opposite directions with focus changes (see §9.3.2), and the displacement, being insensitive to dose fluctuation, is linear with respect to defocus at small deviations from the diffraction focus. Together with the lines L3 and L4 this structure can be used to characterize focus and astigmatism.


182

Variabilities

á^ f°o E m

E

x (7/NA)

- f (NA/A)

(a) transmissivity

x (a/NA)

(b) spectrum

(c) approximation

Figure 9.16: (a) A transmission wedge for monitoring dose variation. (b) Its spectrum. (c) Its approximation.

9.8 Dose Dose change can cause both inherent variation and fluctuation. On the one hand, dose nonuniformity arising from wafer topography, wafer stack, and exposure dose variation, including actual dose fluctuation across exposure field or across the slit of step-and-scan exposure systems, and effective dose variation caused by nonuniform hot plate temperature during post-exposure bake, results in fluctuation because of finite exposure latitude. On the other hand, a uniform dose change such as a deliberate dose modification or a dose setting error affects proximity effect because dose sensitivity (exposure latitude) differs among features. The ideal structure for dose monitoring should be sensitive to dose variation but not to other process detractors. One candidate is a transmission wedge described by Ôx(x) — m(1—mpzI)

0

if ^zI < m,m>0, (9.12) otherwise,

with the spectrum (see Exercise 9.4) m2 [1—cos(27tf/m)]. Ó,f (f) = 22f2

(9.13)

The transmission function and spectrum of an example structure with m = 0.2 is shown in Fig. 9.16(a) and (b). Because of the inverse f^ dependence the spectrum is sizable only around f = 0. Concentration of energy in low-spatial-frequency components implies low sensitivity to aberrations including defocusing. But the gradual transmissivity change ensures responsiveness to dose variations.

The transmission function of Eq. (9.12) is difficult to realize in practice. We can approximate the gradation by a structure comprising an array of spaces at a fixed period but decreasing duty ratio from the middle, as illustrated in Fig. 9.16(c) [1071. This pattern is sensitive to actual and effective dose variations including maskdimensional error.


Flare

183

We can determine dose and focus concurrently using the two structures shown in Fig. 9.17 [108]. One of the patterns, shown in Fig. 9.17(a), consists of two gratings situated in a clear background. The grating period p is beyond resolution by the optical metrology system, namely, p

metrology o < 2NAmetrology

The image varies only in the horizontal direction [along AB in Fig. 9.17(a)], as plotted in Fig. 9.17(c); and we can associate with this structure a width w s . An analogous quantity wl, corresponding to two gratings situated in an opaque background [Fig. 9.17(b)], can be similarly determined [Fig. 9.17(d)]. With increasing defocus, the grating features of both structures shorten, causing ws and w1 to increase. Their focus dependence is approximately quadratic around optimum focus, as shown in Fig. 9.17(e). With dose increase, however, the grating features of Fig. 9.17(b) lengthen linearly (on positive photoresists) while those of Fig. 9.17(a) shorten, causing the behavior illustrated in Fig. 9.17(f). Based on the similarity in focus response and difference in dose behavior, we can develop a parametric model such as the following [109]: wti = ao ; + ai 1 E + (a21 + a3 ; E) (z — zo), i E {l, s}, where the coefficients ao 1 ,...,3, and the best focus position zo are parameters determined by experimental calibration. With these parameters, the focus [or defocus (z — zo)] and dose E can be solved for each pair of measurements w s and wl:

E_

—b+ b 2 — 4ac

2a

and

(z — zo) 2 =

wi

— (ao i +a1 E) ;

a2, +a3 E ;

where

a = al,a3, — a3 1 a1, b = a3 1 WS — a3 wt +ao 1 a3 +al 1 a2, — a2 1 al s — a3,ao., c = a2 1 ws —a2swt +ao l a2, —a2 1 aos . S

5

9.9 Flare Also called stray light and scattered light, flare is light existing in the image beyond that captured in the physics described by Eq. (8.1). Figure 9.18 illustrates the primary sources of stray light. Consider image formation of an object point A with the geometrical image point B. Because of diffraction and possible focus error, the intensity distribution around B is blurred. A fraction of this halo is reflected from the wafer toward the reticle and then back to the wafer from the reticle, adding to the original image intensity. Each additional round-trip reflection increases the amount


184

Variabilities

Figure 9.17: The similarity in focus response (e) and difference in dose behavior (f) between the structures shown in (a) and (b) enables simultaneous determination of focus and dose.

of stray light, with the halo increasing in size but decreasing in brightness. Depicted by the dotted rays in Fig. 9.18, we can reduce this cause of flare by decreasing the reflectivity of the wafer or the reticle or both. Light scattering from optical elements also contribute to flare. Scattering is caused by surface roughness of optical elements. An ideal optical element has smooth surfaces; it performs its designed function without scattering. For example, a thin lens with smooth surfaces focuses a parallel light beam toward a point on the focal plane within the approximations of geometrical optics (see §2.4). There may be undesired specular reflection if the lens anti-reflection coatings are imperfect;


Flare

185

wafer and reticle

-

reflection

B A ^ ^ y

, -

------------surface

D

- aberration JC

scattering

Figure 9.18: Primary sources of flare.

but no light is diffused into other directions. However, a physical optical element has (microscopically) rough surfaces that scatter light. The amount of scattering depends on surface roughness and wavelength according to e(4na/X)2 — 1

where a is the standard deviation of the assumed Gaussian-rough surface [110]. Any light (including image-forming beams and rays bouncing between the wafer and the reticle) that impinges onto a surface is scattered; and the amount of scattered light increases with the number of optical elements. Stray light also arises from high-order aberrations. Recall the discussion in §7.4 that the ray aberration generally increases with the Zernike term number, causing the diffraction pattern to spread. If a wavefront contains significant components of high-order aberrations that are not included in the truncated Zernike-series representation of Eq. (7.17), the diverted light contributes to flare. This source is depicted by the light ray ACD in Fig. 9.18. We can surmise that stray light generally increases with the average transmission of the object, and the amount of flare at a particular field point depends on the magnitudes of these sources and their characteristic light spreading distances [111]. Scattered light has the longest range (on the order of the field size), while ray aberration has the shortest (on the order of micrometers). The amount of flare is a function of object pattern density over a large area. By raising the overall image intensity, stray light reduces contrast. But flare variation across the field causes more fluctuation than contrast loss. As an example, assume a feature prints at a nominal dose of Dprint = 1OD0/3, where Do is the dose to clear a large area of positive photoresist. The corresponding normalized intensity level (threshold intensity) is 3/10 = 0.3. If the flare decreases by 3%, the feature would be printing at a normalized intensity level of 0.30 + 0.03 = 0.33 of the original image. This 3% flare level change translates to a 10% dose variation; Downloaded from SPIE Digital Library on 25 Feb 2010 to 130.60.68.45. Terms of Use: http://spiedl.org/terms

Variabilities

186

the effect of flare variation is amplified. Let us call this effective magnification the flare-amplification factor (FAF): FAF = D^ón

t

Given by the ratio of dose-to-print to dose-to-clear, the flare-amplification factor causes large amounts of fluctuation especially for features with little exposure latitude. By direct addition of light to images, flare causes features to print at a different intensity level, resulting in gross fluctuation. A simple way to measure flare is to expose an opaque object, so large that its nominal interior intensity is zero in the absence of flare, in a sequence of dose steps and observe the dose Ddisappear at which its image formed in positive photoresist completely disappears [Ill]. The ratio of dose to clear D o to Ddisappear is the amount of flare: flare = D0 x 100%. (9.14) Ddisappear

A more detailed characterization of flare can be accomplished by using a large opaque pattern with small openings placed at regular intervals [111], as shown in Fig. 9.19(a). We again expose this test pattern in a sequence of dose steps onto a positive photoresist. As dose increases the edge of the developed photoresist recedes from the designed edge. From the dependence of the resist edge position, determined relative to the small openings, on the exposure dose, we can deduce, using an equation similar to Eq. (9.14), the amount of flare as a function of distance into an opaque region. Figure 9.19(b) shows the result of such a measurement. Three distinct regions are observed. At large distances, the amount of stray light steadies around 3.2%; the background flare (or long-range flare) is 3.2%. At intermediate distances between 1µm and a few tens of micrometers, flare decreases gradually with distance. Sometimes called midrange flare, this behavior is attributable to high-order aberrations [ 112] and wafer-reticle reflections. At distances smaller than 1µm the change in spatial characteristics indicates flare arising from different phenomena. We cannot draw definite conclusions, however, because accuracy of this receding edge method worsens with increasing proximity between the photoresist and geometrical edges due to diffraction and lateral photoresist development effects.

9.10 Remarks We can simulate, with a program such as one developed using the equations of Chapter 8, the effects of most causes of variabilities. Their impact is naturally object-dependent. Even for a specific technology for which only one critical dimension is of primary concern (namely, a single kl factor), image sensitivity can vary greatly among different pattern configurations. Nevertheless, it is possible to


187

Remarks

0

CO

0 large opaque pattern

\

mid-range

0 00

CO

O Q0

long-range

0000

V small openings

short-range

Co 4

0

N -

0.1

1.0

10.0

100.0

distance from designed edge (gm) (a) test pattern

(b) measurement results

Figure 9.19: (a) Test pattern for the receding edge method. (b) An example of mea-

surement results [111 ]. estimate a representative sensitivity for each detractor and subsequently a corresponding control requirement. An exercise performed for a kl = 0.35 technology gives the following estimations: Detractor Proximity effect Mask dimensional error Phase error Transmission error Mask edge roughness Degree of polarization Non-telecentricity Aberrations Focus Dose Flare

Control Requirements OPC ±0.008/M (?o/NA) ±2deg 0.005 (Do)

— < 0.05 (for unpolarized illumination) ±0.01 (p) 0.015 (2 rms) ±0.5 (R. U.) [see Eq. (9.10)] ±2% (Dprint) ±0.5% (Do)

These requirements are determined separately for each variability contributor. During actual processing all detractors affect imaging and their net effects are quantified by metrics such as across-chip linewidth variation (ACLU). The manner in which variabilities from various causes combine is a question that often arises in tasks such as estimating variability, developing control requirements, and deciding how the different components of each cause mix together. (An example of the latter task is the determination of how focal plane deviation, wafer topography, mask flatness variation, focus setting error, and wafer nonflatness combine into an overall focus error.) In one extreme we can assume that variabilities caused by all


Variabilities

188

contributors add linearly. But this conservative and unrealistic assumption results in unnecessarily stringent control requirements. The other extreme assumes that the effects are uncorrelated and random, such that variance of the total across-chip linewidth variation is the sum of the individual detractor variances: t

ótotal =

6i

je {detractors}

This latter assumption is usually adopted in practice. We should beware, however, that effects of independent causes may unwittingly reinforce or cancel one another such that the uncorrelation assumption is violated. For example, the spatial linewidth variation signatures resulting from mask dimensional error, aberrations, and flare fluctuation can have much semblance [113]. In general, individual variabilities and their possible correlation is process dependent. They can be determined only by thorough empirical characterization [113-115]. A well-characterized process would also allow us to perform optical and process correction (OPC), i the technique of predistorting mask patterns to reduce image variability [116-125]. Being a photomask approach, OPC can only remedy inherent variations and systematic and stable fluctuations. We should minimize random variability components for efficacious correction. Even with exact engineering control, however, noise is one random fluctuation that cannot be eliminated. In the context of optical lithography, shot noise refers to the statistical variation in the number of photons used to define a feature. The impact of shot noise depends on the photoresist, the photoacid generation mechanism, and the post-exposure bake and dissolution processes [126-128]. To understand qualitatively the effects of shot noise, consider 193 -nm exposure of a contact hole of area A = 45 nm x 45 nm with a dose of Dp int = 1 mJ cm 2 . The number of photons absorbed Nphotons is approximately h

Nphotons

Dprint X Ithreshold X fabsorbed

xA

Ephoton

where fabsorbed is the fraction of exposure energy deposited into the photoresist, Ithreshold is the threshold intensity of the contact exposure process, and Ephoton is the energy carried by each photon. Assuming the resist absorbs 10% of the exposure energy and Ithreshold = 0.3, 10-3•0.3•0.1•2025 x 10 Nphotons =

-

14

x 10 +$

6.6256 X 10-34. 3193x10- 9 590.

There are only 590 = 24 photons per side, with each corresponding to a substantial 4.1% of the critical dimension. Stochastic variation in the number of photons `''OPC was originally shorthand for optical proximity correction. But the correction has since been applied to compensate for nonoptical processing effects.


Exercises

189

absorbed can cause resist line-edge roughness, resulting in higher variability. Such line-edge roughness can be reduced by increasing the number of absorbed photons, thereupon lowering the photoresist speed.

Exercises 9.1 Derive Eq. (9.8). 9.2 Derive Eq. (9.10). 9.3 In the low-NA limit, what is the Zernike series representation of one Rayleigh unit? 9.4 Derive Eq. (9.13).


Appendix A

Birefringence The theories and equations explicated in the text were based on two distinct sets of fundamental equations: Maxwell's equations [Eqs. (1.1)—(1.4)], which governs wave propagation; and constitutive relations [Eqs. (1.7)—(1.9)], which describe the interaction of electromagnetic waves with materials. We have assumed in our developments that materials are optically isotropic such that the electric displacement D and electric field E are related by the dielectric constant £e , and the magnetic field H and magnetic induction B are related by the permeability µ,n : D = £ e (w)E

and

Bim(CO)H.

(A.1)

The permittivity and permeability are frequency dependent; however, they can be considered constants since we are concerned with monochromatic waves. The constitutive relations described by Eq. (A. 1) indicate that the electric field is aligned with the displacement, and the magnetic field with the induction. In an anisotropic material, however, these pairs of vectors are not parallel. For electrically anisotropic but magnetically isotropic materials, we can retain the relationship between the magnetic field and induction in Eq. (A. 1), but the simplest connection between the electric field and displacement becomes a linear matrix transformation: D=

Dx

[EJ

£ems £e Xy E e l eyx £ eyy LeyZ Ey

= I£

£ezy £e z ,

= £e E,

EZ

where the nine quantities £ejj constitute the dielectric tensor £e .' Let us investigate consequences of misalignment between the electric field and displacement on wave propagation. For a monochromatic plane wave of angular frequency co, the field phasors are proportional to exp(ice[ --r.s] —t). 'Born and Wolf [129] contains detailed discussions on optical behavior in anisotropic materials.

191


192

Birefringence

B H

S

E

E1

'

D Figure A.1: The electric field E and electric displacement D are not parallel in an electrically anisotropic medium.

We can express the partial derivatives of the field vectors as

á

E = —iwwE,

and

all

= — iwH,

V x E = tw—ss x E,

VxH=iwnsxH.

In the absence of currents, Eqs. (1.1) and (1.2) become H=— n 9xE

and

JLmCO

D= n9xH. co .

(A.2)

Substituting the first expression of Eq. (A.2) into the second and using the vector identity Ax(BxC)- (A.C)B—(A•B)C gives n2

n2

n2

m C0

mC0

UmC0

D =— 2 sx(s"xE)= 2 [E—(9 -E)s]= 2 E 1 .

( A.3)

The quantity E1 denotes the vector component of E perpendicular to in the plane formed by E and s, as illustrated in Fig. A. 1. Since, from Eq. (A.2), the vectors H and B are perpendicular to E, D, and s, the latter three vectors are coplanar, with D being orthogonal to "s. The magnetic field H and the electric displacement D are transverse to the normal of the wavefront 9, as in an isotropic medium; but the electric field E is not. To understand the manner in which wave propagates in anisotropic materials, we revisit the expressions for the electric and magnetic energy densities [Eq. (1.14)]


Birefringence

193

and the Poynting vector [Eq. (1.28)]: 1 2

1 we = 1 E • D = 1 Ei£eijEj,

2i

(A.4)

={x,y,z} j={x,y,z}

w m = ZH•B= 2 IH^ 2 ,

and

S=ExH. Performing mathematical manipulation similar to that leading to Eq. (1.12), we derive that —V.(ExH)=E.-D+H aB at at =

aEj u„

óM1 2

Ei£e `^ 2 + 2 at i ={x,y,z} j={x,y,z}

The term on the left side is the rate of total energy density increase. In the absence of absorption or amplification, it should be the sum of the rate of electric and magnetic energy density changes. For the first term to represent the rate of electric energy density change, it must equal the time derivative of Eq. (A.4): aEj1

aE j aEi

[^

1 Ei£e at = 2 L £e"( Ei —+E —) at at ° i ={x,y,z} j={x,y,z} i ={x,y,z} j={x,y,z}^' 1

meaning 1 aEi

aE j

1

at

i ={x,y,z} j={x,y,z}

£e`' (E —

El

at

=

^+

(EejjEejj)Ei

L

0.

i ={x,y,z} j={x,y,z}

For nonzero electric field components, the above expression is zero if, and only if, £e jj = £ ejj .

The dielectric tensor is symmetric. It has six instead of nine independent components. We can now write the electric energy density as

2

£ E Z £e E? £ E?

We = e 2 x +

Y

+

e

2

+ £eyz EyEZ + £exz ExE2 + Eeg ExE9.

(A.5)

Since the energy density must be greater than or equal to zero, Eq. (A.5) is a semipositive-definitive quadratic form. Spatial points with the same energy density form


Birefringence

194

an ellipsoid surface. There exists a coordinate system (x',y',z') such that the energy density is We = Eex,Ex +£ey Ey, + Eez E^, where C ex„ E ey„ and Eeg, are the principal dielectric constants (or principal permittivities). In this system of principal dielectric axes, the material equations are simplified: D'

=

E ez, ExI ,

Dyi = Eey, Ey ,

and

DZ' = Cez, Ezi .

(A.6)

Because the principal dielectric constants are distinct for an anisotropic material, the electric field E and displacement D are not parallel unless E coincides with one of the principal axes." Substituting Eq (A.6) into Eq. (A.3) gives, after some straightforward algebra, skEk

n 2 ( g

E )sk2

, — n2 —ûm£ekCO

k E {x, y', z' }.

(A.7)

Adding this set of three equations together and dividing the sum by (s • E) results in k={ ',z'}

Subtracting both sides by

n2s2 k um6ekCO

n2_

sk = 1, we obtain k={x' y',z'}

2 2

2 2

(A.8)

f ____OSk v = 2 psk 2 2 2 k={x',Y ,z'} n —,Um£ek CO k={x',y',z'} vk — Vp

where V P = co/n and vx, _

1

vy1 _

1

um£e^

and

vz,

1 ,umEeZ,

are the three principal propagation velocities. Equation (A.8) is a quadratic equation in vp. There are two values of v p for every direction s". With each of the two values of v p , we can solve Eq. (A.7) for the ratios between the electric field components, and subsequently find the electric displacement components using Eq. (A.6). Since the ratios between the field components are real, the electric field and displacement are linearly polarized. We can conclude that an anisotropic medium permits two monochromatic plane waves with two different linear polarizations propagating with two different velocities in any given direction. The electric field also aligns with the displacement if E e,, _ £ey, = se . But this is the isotropic case.


195

Birefringence

A ramification of having two propagation velocities is a phenomenon known as double refraction or birefringence. When a plane wave incidents from an isotropic medium onto an anisotropic one, there are generally two refracted rays on the plane of incidence, since, according to Snell's law [Eq. (6.22)],

e

Si n int

_ Cinc

sin 8tran Ctran ctr and hence 6tran can assume two values.' In photolithography lenses made of birefringent materials (such as calcium fluoride for 157-nm lithography), each light ray entering an imaging system with N lens elements gives rise to 2N beams in the exit pupil. Such birefringence can be described by a 2 x 2 matrix that characterizes coupling between the different waves [130, 131 ].

"'In addition to intrinsically anisotropic materials, birefringence can be caused by anisotropy induced in nominally isotropic media resulting from mechanical stress, or by an orderly arrangement of particles of isotropic materials. The former phenomenon is called stress birefringence, while the latter is called form birefringence.


Appendix B

Stationarity and Ergodicity l In computing time averages of optical fields, we regard the ensembles to be stationary and ergodic. Stationarity means that all ensemble averages are independent of the time origin; and ergodicity implies that each ensemble average is equal to the time average involving a typical member of the ensemble. If we denote the field at a point Pi arising from a point source i by Ul (Pi, t), < U(P1,t) >= f Uip(Ui)dUi = 0,

since Ul (Pi, t) can be viewed as a random process on a time scale much longer than the coherence time of the light. Now consider another field generated by a different point source j. Let us denote the field at P2 arising from j by U1 (P2, t). The time average of U, (Pl , t) U1 (P2, t) is +00

< Ui(P1,t)Uu(P2,t) >= ffuuJp(u,uJ)duduj.

Since U (Pl , t) and U^ (Pa, t) are statistically independent, P(U, U.i) = P(Ui)P(U1),

and < Ui(Pi,t)Uj(P2 i t) > = fUp(U)dUiJUjp(Uj)dU1 =< U(P,,t) >< U;(P2 i t) > =0. 'This appendix is contributed by Anthony Yen. 197


Appendix C

Some Zernike Polynomials j

n

m

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37

0 1 1 2 2 2 3 3 3 3 4 4 4 4 4 5 5 5 5 5 5 6 6 6 6 6 6 6 7 7 7 7 7 7 7 7 8

0 1 1 0 2 2 1 1 3 3 0 2 2 4 4 1 1 3 3 5 5 0 2 2 4 4 6 6 1 1 3 3 5 5 7 7 0

an

R,`, (0)YJ (0)

vÏ \ V6 \

v

V 10 10 10

viö 12 12 12 12 VÏ2 12 14 14 14 14 14 16 16 16 16 16 16 VÎ 6 16

(+1) (+p)cosrp (+p)sin4 (+2 2 _1) (+p 2 )sin24 (+p 2 )cos24 (+3p 3 -25)sin4 (+3p3 — 2p)cos4 (+p3)sin34 (+53)cos34 (+6p 4 -65 2 +1) (+40 4 - 3p 2 )cos2¢ (+40 4 — 3p 2 )sin24 (+p 4 )cos44 (+p 4 )sin44 (+10p 5 —12pá +3p)cos4 (+100 5 —120 3 +3p)sin4 (+50 5 -4p 3 )cos3c (+50 5 —4p 3 )sin30 (+p 5 )cos54 (+p 5 )sin54 (+2006 — 3004 + 120 2 — 1) (+150 —200 4 +6p 2 )sin20 (+150 6 — 200 4 + 6p 2 )cos 20 (+60 6 - 5p 4 )sin44 (+60 6 — 5p 4 )cos 44 (+p6)sin64 (+p )cos64 (+350 7 — 600 5 + 305 3 — 4p)sin 4 (+350 7 —600 5 +300 3 —4p)cos4 (+210 7 — 300 5 + 10p 3 )sin34 (+210 7 — 300 5 + 100 3 )cos 34 (+70 7 - 615 5 )sin50 (+70 7 - 60 5 )cos54 (+p 7 )sin7m (+p 7 )cos74 (+7008-14006+9004-2002+1)

199


Some Zernike Polynomials

200

JÎ8

38 39 40 41 42 43 44 45 46 47 48 49 50

8 8 8 8 8 8 8 8 9 9 9 9 9

2 2 4 4 6 6 8 8 1 1 3 3 5

51

9

5

52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93

9 9 9 9 10 10 10 10 10 10 10 10 10 10 10 11 11 11 11 11 11 11 11 11 11 11 11 12 12 12 12 12 12 12 12 12 12 12 12 12 13 13

7 7 9 9 0 2 2 4 4 6 6 8 8 10 10 1 1 3 3 5 5 7 7 9 9 11 11 0 2 2 4 4 6 6 8 8

10 10 12 12

1 1

18 iT8 18 VÎ 8 18 18 18 20 20 20 20 20 20 20 20 20 20 11 22 22 22 22 22 22 22 22 22 22 24 24 24 24 24 24 24 24 24 24 Vi 24 VT3 26 v26 26 26 26 26 26 26 26 26 26 26 28 28

(+560 8 —10515 6 +600 4 —10p 2 )cos24 (+5615 8 —1050 6 +6004 —10p 2 )sin 2cß (+2815 8 - 420 6 +1515 4 )cos40 (+280 8 — 420 6 + 15p 4 )sin44 (+80 8 - 715 6 )cos6c (+80 8 —7p 6 )sin60 (+15 8 )cos80 (+p 8 )sin84 (+1260 9 -2800 7 +2100 5 - 600 3 +5p)cos4 (+1260 9 - 2800 7 +2100 5 - 600 3 +5p)sin0 (+840 9 —1680 7 + 1050 5 — 200 3 )cos 34 (+840 9 —1680 7 + 1050 5 — 200 3 ) sin 30 (+360 9 - 560 7 +2115 5 )cos54 (+360 9 - 560 7 +2115 5 )sin5 0 (+90 9 — 8p 7 )cos74 (+90 9 -80 7 )sin70 (+15 9 )cos90 (+15 9 )sin90 (+2520 10 — 6300 8 + 5600 6 — 2100 4 + 300 2 —1) (+2100 10 —5040 8 +4200 6 —1400 4 + 15p 2 )sin24 (+2100 10 — 50415 8 +4200 6 —1400 4 + 15p 2 )cos 20 (+1200 10 - 2520 8 +1680 6 - 35p 4 )sin44 (+ 1200 10 — 25215 8 + 1680 6 — 350 4 )cos 40 (+450 10 - 720 8 +28p 6 )sin60 (+45 0 10 — 720 8 + 28p 6 )cos 60 (+10p 10 - 90 8 )sin80 (+10p 10 - 90 8 )cos80 (+p 10 )sinl04 (+15 10 )cos 100 (+4620 11 12600 9 + 126015 7 — 5600 5 + 1051 3 — 615) sin 0 (+4620 11 -12600 9 + 12600 7 — 5600 5 + 1050 3 — 6p)cos 4 (+3300 11 — 8401 9 + 75615 7 — 2800 5 + 350 3 ) sin 34 (+3300 11 -84015 9 +75615 7 -28015 5 +35p 3 )cos30 (+1650 11 — 3600 9 + 2520 7 — 560 5 ) sin 50 (+1650 11 3600 9 +2520 7 - 56p 5 )cos51 (+55p 11 — 900 9 + 360 7 ) sin 70 (+550 11 — 900 9 + 36 5 7 ) cos 70 (+110 11 100 9 )sin90 (+110 11 1015 9 )cos94 +p l l )sin 114 (+P 11 )cos 110 (+9240 12 — 27720 10 + 31500 8 —168015 6 +42015 4 _4215 2 +1) -

-

-

-

(

(+7920 12 — 23100 10 + 25200 8 —12600 6 +2805 — 210 2 )cos 20 (+7920 12 — 23100 10 + 25200 8 —12600 6 + 2800 4 — 21 15 2 )sin 21 (+49515 12 -13200 10 +12600 8 -5040 6 +7015 4 )cos4 0 (+4950 12 —132015 10 + 126015 8 — 5040 6 + 700 4 ) sin 40 (+2200 12 — 4950 10 + 3600$ — 840 6 )cos 60 (+2200 12 - 4950 1 0 +360p 8 - 8415 6 )s1n64 (+660 12 -110p 10 + 45p 8 )cos80 (+660 12 -110p 10 + 450 8 )sin80 (+120 12 —11p 10 )cos 100 (+ 120 12 —11p 10 )sin 100 (+15 12 )cos 120 (+0 12 )sin 120 + 12600 5 —1680 3 + 7p)cos 0 (+17160 13 — 55440 11 + 69300 9 —420015+ (+17160 13 — 554415 11 + 69300 9 — 42000 7 + 12600 5 —16ßp 3 + 715) sin 0



94 13 3 95 13 3 96 13 5 97 13 5 98 13 7 99 13 7 100 13 9 101 13 9 102 13 11 103 13 11 104 13 13 105 13 13 106 14 0 107 14 2 108 14 2 109 14 4 110 14 4 111 14 6 112 14 6 113 14 8 114 14 8 115 14 10 116 14 10 117 14 12 118 14 12 119 14 14 120 14 14 121 15 1 122 15 1 123 15 3 124 15 3 125 15 5 126 15 5 127 15 7 128 15 7 129 15 9 130 15 9 131 15 11 132 15 11 133 15 13 134 15 13 135 15 15 136 15 15

201

28 (+12870 1 3 -39600"+46200 9 -25200 7 +630p á -560 3 )cos30 28 (+12870 13 —3960" +46200 9 —25200 7 +6300 5 — 563 3 )sin34 28 (+7150 13 —19800" + 19800 9 — 8400 7 + 1260 5 )cos 50 28 (+71áp 1 ' — 19800" + 19800 9 — 8400 7 + 126p 5 )sin50 28 (+2860 1 ^ — 6600" +4950 9 —1200 7 )cos 70 28 (+2860 13 -6600 11 +4950 9 -120p 7 )sin70 28 (+780" 1 3 -1325 11 +55p 9 )cos94 28 (+780 13 —1320" +55p 9 )sin94 28 (+130" 13 —120")cos 110 28 (+130 1 3 — 120 11 )sin 114 28 (+p' 3 )cos 134 28 (+p 13 )sin 130 vTi5 (+34320 14 -120120 12 +166320 10 -115500 8 +42000 6 -7560 4 +560 2 -1) 30 (+30030 14 — 1029615 12 + 138600 10 — 92400 8 + 31500 6 —50415 —5040 4 +28 0 2 ) sin 20 30 (+30030 34 — 102960 12 + 1386015 10 — 92400 8 + 31500 6 — 5ß4p 4 + 28p z )cos 24 30 (+20020 14 — 64350 12 +792015 10 — 46200 8 + 12600 6 — 12615 4 )sin40 30 (+20020 14 -64350 12 +79205 10 -46200 8 +12600 6 -126p 4 )cos441 30 (+ 100115 14 -28600' 2 +29700 10 -13200 8 +2101 )sin60 30 (+10010 14 -28600' 2 +297015 10 -13200 8 +21015 6 )cos641 30 (+3640 14 — 8580 12 + 66015 10 —1650 8 )sin 80 30 (+36415 14 — 8580 12 + 6600 10 —16515 8 )cos 8i 30 (+910 14 -1560 12 +660 10 )sin100 30 (+910 14 —1560' 2 +66p 10 )cos 100 30 (+140 14 -1315' 2 )sin120 30 (+140 14 -1315' 2 )c0s120 30 (+p 14 )sin140 30 (+15 14 )cos140 32 (+64350 15 _2402415 13 +3603615h 1 — 277201 9 + 1155007 —2520i5 +25215 — 8p)sin 0 32 (+64350 15 — 240240 13 +360361511 —2772015 0 + 1155015v —252015 +25215 — 815)cos o 32 (+500515 15 — 180180 13 +25740151 —184800 9 +693015 v — 12601 5 + 8415 3 )sin 341 32 (+50050 15 — 180180 13 + 257400 11 — 184800 9 + 69300 7 - 12600 5 + 840 3 )cos 30 32 (+30030 15 —100100 13 + 128700 11 — 79200 9 +231015 7 — 25215 5 )sin50 32 (+30ß3p 15 — 100100 13 + 128700 11 —792015 +23100 7 — 252p 5 )cos50 32 (+13650 15 -40040' 3 +429ßp"-19800 9 +33013 7 )sin74 32 (+136515 15 -40040 13 +42900 11 -19800 9 +330p 7 )cos7, 32 (+45515 15 -10920 13 +8580"-22015 9 )sin94 32 (+45515 15 — 109215 13 + 8580 11 — 2200 9 )cos 90 32 (+10515' 5 -1820 13 +7815")sin110 32 (+ 1050' 5 —1825 13 +780")cos 110 32 (+150 1 5 —1415 1 3 )sin 130 32 (+ 1515 15 — 14p 13 )cos 134 32 (+15 15 )sin 150 32 (+15 15 )cos 154


202





203

I

Appendix D

Simulator Accuracy Tests It is often desirable to assess the accuracy of an imaging simulation program. A test suite for fidelity evaluation should be comprehensive, and the cases should admit analytic solutions. In this chapter we construct a set of tests, based on the physics encapsulated in Eq. (8.1), which can be used to gauge the accuracy of simulation programs.

D.1 Blank mask For a mask that is completely transmitting,

Ô(1,9)=1

6 (f,) = S(Í)S(8)

and

Because of the obliquity factor (see §4.2), the coupling image intensity, although constant, is not necessarily unity: +00

I^1,9,z) —

2 1

ff

—

J(f,)

nimM age M2"2e

_ P e

dfd8•

For radially symmetric illumination described by i 1(1,g)

if Ginner P C bouter,

100'ute

0

otherwise,

the intensity is

_

bouter Sln eobj

2^ S

— 7C Sin t bob ( outer uter

=

6inner 2 )

J

1 — n i2tnageM2I5 1— p2

PedPe

Ginner Slrieobj

1-6 ener Slri z eobi

nimageM

1 + a dx,

sin t Bobj (óouter — 6 inner)

i — bouter Sin t eobj

x

205


206

Simulator Accuracy Tests

where x = 1-

pé and a = 1 / ( n

ge M 2 )

- 1. The indefinite integral is analytic:

1 + a dx = z+ a ln(a/2+x+z), with z = x(a+x). The intensity is

2

I = 2 ai geM sin eobj (bouter -

Z+ a ln(a/2+x+z) ó inner) 2

1-6 ener s in t 9 obj 1-6o„ters1n2Oobj

Table D. 1 tabulates the intensities of various imaging systems. Table D.1: Coupling image intensities of a 100% transmitting mask imaged by various systems.

nimage

1

NAóinner

NAóouter

sin9 o bi 0.0

sinO o bj 0.0

1

1

1

0.2 0.4 0.6 0.8 0.4 0.6 0.8 0.6 0.8 0.8

1 1 1 1 1 1 1 1 1 1

1.0095713537 1.0409095183 1.1046046472 1.2363625785 1.0513555731 1.1164838089 1.2514819934 1.1555607504 1.3015135985 1.4057656329

1.0097995793 1.0418649964 1.1069527619 1.2412943339 1.0525534688 1.1190969098 1.2567273175 1.1590229744 1.3077707797 1.4140192121

0.0 0.2 0.4 0.6 0.8 0.4 0.6 0.8

1 0.9870433935 0.9415567288 0.8274118142

1 1.0087783694 1.0375828099 1.0963999155 1.2190366429 1.0471842901 1.1073526087 1.2330538611

1 1.0092923179 1.0397401291 1.1017257759 1.2303000474 1.0498893995 1.1132799582 1.2450338960

0.2

0.4 0.6 1.5

0.0

0.2

M=1

0.9263945072 0.8074578668

-

M

= 0.25

M

= 0.2


Images of M = 1 systems

207

D.2 Images of M = 1 systems For aberration-free unity-magnification systems in which the object and imagespace media are the same, the scalar-coupling image is given by a simplified form of Eq. (8.3): —i2 Î(1,.9,z) = I I Cm' W WOmn n, e

TCCm' n'; mx nrf,

n",m" n',m'

where / mr nr m` 0 l --- 1^circ —,-,1 dfdg, Py ( P.„ Icy +_

TCCmn,m"n" =ff f(J)circ

px

(D.1)

and circ(fo go 6) = 1 0

if ^(f —fo)2+(g—go)2 < 6, otherwise

is the circle function. For effective sources of uniform intensity, each transmission cross-coefficient is proportional to the overlapping area of the effective source and the two displaced pupils. The integral of Eq. (D. 1) can be computed analytically [132,133]. With circular sources, for example, we can determine the overlapping area of three circles by geometry or by using Stoke's theorem [ 134] (see Exercise 8.4). In the following subsections we present aerial images of representative objects produced by various optical systems.

D.2.1 Chromium-on-glass mask under on-axis illumination Object and imaging configuration: g if

Óx{z) = tf

tbg

Imaging parameters 500 nm 0.5 J(f,g) circ(0,0,0.5) ko NA

(z—npx I < d/2 n E Z, otherwise. Object parameters px 1.5 d 0.5 tfg 1 tb g 0

The image and select intensity values are shown in Fig. D.1.


(5.5)

Simulator Accuracy Tests

208

ó I

position (ko /NA)

L (

0

ó

-0.50

0.00

-0.25

0.25

PL

0.50

position (A/NA)

intensity 0.702211 0.684056 0.631798 0.551715 0.453194 0.347301 0.245127 0.156188 0.087137 0.040992 0.017003

0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40 0.45 0.50

Figure D.1: Coupling intensity distribution of a line -space on a chromium-on-glass mask under partially coherent illumination.

D.2.2 Dipole illumination of attenuated phase-shifting mask Object and imaging configuration:

X0

Object [Eq. (5.5)]

Imaging parameters 500 nm

NA

0.5

J(f,g)

circ(+1/2p ,,0,0.2)

Ax d tfg

-

0.8 0.4 1

tb g -(TC

2 )/( it i 2)

The image and select intensity values are shown in Fig. D.2. position (ao /NA) intensity

I

a) ó

-0.4

-0.2

0.0

0.2

0.4

0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40

0.605236 0.582200 0.516601 0.418425 0.302618 0.186811 0.088635 0.023035 0.000000

position (A/NA)

Figure D.2: Coupling intensity distribution of a line-space on an attenuated phaseshifting mask under dipole illumination.


209

Images of M = 1 systems

D.2.3 Equal line-space on alternating phase-shifting mask Object and imaging configuration: 1

if

-2np,1 < (p,- 2),

Ô(î) = -1 if (j3 +Z) < 1- 2nß I < (2px

0

(5.22)

otherwise.

Imaging parameters 500 nm Xo NA 0.5

J(f g)

-i),

Object parameters 0.5 A, 0.25 d

(f,8); circ (0, 0, 0.3 )

z

Onm

This is the scenario investigated in Exercise 5.8. The images and select intensity values under the two illumination configurations are shown in Fig. D.3.

r

T

52 5

-0.25

-0.50

0.25

0.00

0.50

position (Xo/NA)

a=0

a = 0.3

0.00

0.810569

0.189713

0.05 0.10 0.15 0.20

0.733167 0.530525 0.280045 0.077403

0.189713 0.189713 0.189713 0.189713

0.25

0.000000

0.189713

position (2./NA)

Figure D.3: Coupling intensity distributions of an equal line-space on an alternating phase-shifting mask under coherent and partially coherent illumination.

D.2.4 Periodic contacts on chromium-on-glass mask Object and imaging configuration: tfg if

1z-mpx ^

Optical Imaging in Projection Microlithography

Optical Imaging in Projection Microlithography

Optical imaging and spectroscopy

Biomedical optical imaging

Projection

In Vivo Optical Imaging of Brain Function,

In Vivo Optical Imaging of Brain Function

Further Developments in Scientific Optical Imaging

Optical Imaging Techniques in Cell Biology

Projection

Optical Waveguide Sensing and Imaging

In Vivo Optical Imaging of Brain Function (Frontiers in Neuroscience)

In Vivo Optical Imaging of Brain Function (Frontiers in Neuroscience)

Translational Multimodal Optical Imaging (Bioformatics & Biomedical Imaging)

Single-molecule optical detection, imaging and spectroscopy

The Optical Transfer Function of Imaging Systems

Single-Molecule Optical Detection, Imaging and Spectroscopy

Imaging the Brain with Optical Methods

Optical Imaging of Cancer: Clinical Applications

Handbook of Imaging Materials (Optical Engineering)

Optical Imaging of Cancer: Clinical Applications

Handbook of Imaging Materials (Optical Engineering)

Astral Projection

Microlithography Molecular Imprinting

Handbook of VLSI Microlithography

Stereographic projection

Single-Photon Imaging (Springer Series in Optical Sciences 160)

Projection factorisations in partial evaluation

Mental and Astral Projection

Projection Displays, Second Edition

The projection method

Optical Imaging in Projection Microlithography