To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????????????h???????韋 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111000001010001000001111110011111100111111001111110011111100111111001111111110100011101000 3f3f3f3f3f3f3f3f3f3f3f3f3f82883f3f3f3f3f3f3fe8e8
EUC-JP ?????????????h???????韋 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111010001111101000001111110011111100111111001111110011111100111111001111111111000011101010 3f3f3f3f3f3f3f3f3f3f3f3f3fa3e83f3f3f3f3f3f3ff0ea
UTF-8 溜삳젒溜삳젚溜븐뵿溜믩졋留h쵗溜묐졎溜뺣졋韋 111011111010011110001011111011001000001010110011111011001010000010010010111011111010011110001011111011001000001010110011111011001010000010011010111011111010011110001011111010111011100010010000111010111011010110111111111011111010011110001011111010111010111110101001111011001010000110001011111011111010011110001101111011111011110110001000111011001011010110010111111011111010011110001011111010111010110010010000111011001010000110001110111011111010011110001011111010111011101010100011111011001010000110001011111010011001111110001011 efa78bec82b3eca092efa78bec82b3eca09aefa78bebb890ebb5bfefa78bebafa9eca18befa78defbd88ecb597efa78bebac90eca18eefa78bebbaa3eca18be99f8b
UHC 溜삳젒溜삳젚溜븐뵿溜믩졋留h쵗溜묐졎溜뺣졋韋 1110101011111110101110111110101110100000100100011110101011111110101110111110101110100000100101101110101011111110101110101110110010010100101111011110101011111110100100101110101110100000101110101110101110100111101000111110100010101100100110011110101011111110100100011110101110100000101110111110101011111110100101011110101110100000101110101110101011011111 eafebbeba091eafebbeba096eafebaec94bdeafe92eba0baeba7a3e8ac99eafe91eba0bbeafe95eba0baeadf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)