To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???揖??幽???λ9有??猷??筌??? 0011111100111111001111111001011101001011001111110011111110010111010010000011111100111111001111111000001111001001100000100101100010010111010011000011111100111111100101110101000100111111001111111110001010100011001111110011111100111111 3f3f3f974b3f3f97483f3f3f83c98258974c3f3f97513f3fe2a33f3f3f
EUC-JP ???揖??幽???λ9有??猷??筌??? 0011111100111111001111111100110110101100001111110011111111001101101010010011111100111111001111111010011011001011101000111011100111001101101011010011111100111111110011011011001000111111001111111110010010100101001111110011111100111111 3f3f3fcdac3f3fcda93f3f3fa6cba3b9cdad3f3fcdb23f3fe4a53f3f3f
UTF-8 捻뀁늿揖루댆幽껊짎若λ9有곻쬅猷뱀삏筌뉎끃劉 1110111110100110101001001110101110000000100000011110101110001010101111111110011010001111100101101110101110100011101010001110101110001100100001101110010110111001101111011110101010111011100010101110110010100111100011101110111110100101101101001100111010111011111011111011110010011001111001101001110010001001111010101011001110111011111011001010110010000101111001111000110010110111111010111011000110000000111011001000001010001111111001111010110110001100111010111000100110001110111010111000000110000011111011111010011110000111 efa6a4eb8081eb8abfe68f96eba3a8eb8c86e5b9bdeabb8aeca78eefa5b4cebbefbc99e69c89eab3bbecac85e78cb7ebb180ec828fe7ad8ceb898eeb8183efa787
UHC 捻뀁늿揖루댆幽껊짎若λ9有곻쬅猷뱀삏筌뉎끃劉 1110011011110111101100101110110010001000100010001110101111100111101101111110011110001000101100001110101011101011100000111110101110100011100110101110010110101110101001011110101110100011101110011110101011110011100000011110111110100110100111001110101110100011101110011110110010011000100101101110111110100111100001111110001110000101101110011110101011100101 e6f7b2ec8888ebe7b7e788b0eaeb83eba39ae5aea5eba3b9eaf381efa69ceba3b9ec9896efa787e385b9eae5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)