To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???節??泳??譯??鴉??秧??嚥?? 001111110011111100111111100100001101111100111111001111111000100101101010001111110011111111100110101000010011111100111111111010011110101100111111001111111110001001011110001111110011111110011010100010110011111100111111 3f3f3f90df3f3f896a3f3fe6a13f3fe9eb3f3fe25e3f3f9a8b3f3f
EUC-JP 縕??節??泳??譯??鴉??秧??嚥?? 1000111111010100110000100011111100111111110000001110000100111111001111111011000111001011001111110011111111101100101000110011111100111111111100101110110100111111001111111110001110111111001111110011111111010011111010110011111100111111 8fd4c23f3fc0e13f3fb1cb3f3feca33f3ff2ed3f3fe3bf3f3fd3eb3f3f
UTF-8 縕귚폏節얍뜝泳졿옅譯볩슛鴉싷슭秧좂텭嚥뜻뱮 111001111011100010010101111010101011011110011010111011011000111110001111111001111010111110000000111011001001011010001101111010111001110010011101111001101011001110110011111011001010000110111111111011001001100010000101111010001010110110101111111010111011001110101001111011001000101010011011111010011011010010001001111011001000101110110111111011001000101010101101111001111010011110100111111011001010001010000010111011011000010110101101111001011001101010100101111010111001110010111011111010111011000110101110 e7b895eab79aed8f8fe7af80ec968deb9c9de6b3b3eca1bfec9885e8adafebb3a9ec8a9be9b489ec8bb7ec8aade7a7a7eca282ed85ade59aa5eb9cbbebb1ae
UHC 縕귚폏節얍뜝泳졿옅譯볩슛鴉싷슭秧좂텭嚥뜻뱮 111010001011001010000010111001001011110010011010111011111011110110111110111001011000110110100000111001111011011010100000111001101011111110110110111001101011101110010011111011111011110110111000111001001011110010011010111011111011110110111110111001001110101110100000111001111011011010100000111001101011111110110110111001101001001110010100 e8b282e4bc9aefbdbee58da0e7b6a0e6bfb6e6bb93efbdb8e4bc9aefbdbee4eba0e7b6a0e6bfb6e69394

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)