To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鶯??????陰??碎??鶯??????陰 1110100111110010001111110011111100111111001111110011111100111111100010010100000100111111001111111110000111101010001111110011111111101001111100100011111100111111001111110011111100111111001111111000100101000001 e9f23f3f3f3f3f3f89413f3fe1ea3f3fe9f23f3f3f3f3f3f8941
EUC-JP 鶯???獒??陰??碎??鶯???獒??陰 111100101111010000111111001111110011111110001111110010111011101100111111001111111011000110100010001111110011111111100010111011000011111100111111111100101111010000111111001111110011111110001111110010111011101100111111001111111011000110100010 f2f43f3f3f8fcbbb3f3fb1a23f3fe2ec3f3ff2f43f3f3f8fcbbb3f3fb1a2
UTF-8 鶯ㅺ퉮횞獒뺣뛾陰덅퐛碎룻렧鶯ㅺ퉮횞獒뺣뛾陰 111010011011011010101111111000111000010110111010111011011000100110101110111011011001101010011110111001111000110110010010111010111011101010100011111010111001101110111110111010011001100110110000111010111000110110000101111011011001000010011011111001111010001010001110111010111010001110111011111010111010000010100111111010011011011010101111111000111000010110111010111011011000100110101110111011011001101010011110111001111000110110010010111010111011101010100011111010111001101110111110111010011001100110110000 e9b6afe385baed89aeed9a9ee78d92ebbaa3eb9bbee999b0eb8d85ed909be7a28eeba3bbeba0a7e9b6afe385baed89aeed9a9ee78d92ebbaa3eb9bbee999b0
UHC 鶯ㅺ퉮횞獒뺣뛾陰덅퐛碎룻렧鶯ㅺ퉮횞獒뺣뛾陰 111001011010001110100100111010101011100110000110110000111001011111101000101000111001010111101011100011011000010011101011111001001000100011101000101111011000010111100001111011111011011111101101100011101011011011100101101000111010010011101010101110011000011011000011100101111110100010100011100101011110101110001101100001001110101111100100 e5a3a4eab986c397e8a395eb8d84ebe488e8bd85e1efb7ed8eb6e5a3a4eab986c397e8a395eb8d84ebe4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)