To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???苡ュ?節???ш????攸??搖??? 00111111001111110011111111100100100011111000001110000101001111111001000011011111001111110011111100111111100001001000101000111111001111110011111100111111100111011011111100111111001111111001110110001010001111110011111100111111 3f3f3fe48f83853f90df3f3f3f848a3f3f3f3f9dbf3f3f9d8a3f3f3f
EUC-JP ???苡ュ?節???ш????攸??搖??嫄 001111110011111100111111111001111110111110100101111001010011111111000000111000010011111100111111001111111010011111101010001111110011111100111111001111111101101011000001001111110011111111011001111010100011111100111111100011111011101010100001 3f3f3fe7efa5e53fc0e13f3f3fa7ea3f3f3f3fdac13f3fd9ea3f3f8fbaa1
UTF-8 列룸벊苡ュ룛節뗰폊嶪ш퉮杻ⓨ푻攸낉펻搖삳뎻嫄 1110111110100110100111001110101110100011101110001110101110110010100010101110100010001011101000011110001110000011101001011110101110100011100110111110011110101111100000001110101110010111101100001110110110001111100010101110010110110110101010101101000110001000111011011000100110101110111011111010011110001000111000101001001110101000111011011001000110111011111001101001010010111000111010111000001010001001111011011000111010111011111001101001000010010110111011001000001010110011111010111000111010111011111001011010101110000100 efa69ceba3b8ebb28ae88ba1e383a5eba39be7af80eb97b0ed8f8ae5b6aad188ed89aeefa788e293a8ed91bbe694b8eb8289ed8ebbe69096ec82b3eb8ebbe5ab84
UHC 列룸벊苡ュ룛節뗰폊嶪ш퉮杻ⓨ푻攸낉펻搖삳뎻嫄 1110011011101010101101111110101110010011101011011110110010111110101010111110010110001111100101111110111110111101100010111110111110111100100101011110010111110101101011001110101010111001100001101110101011110100101010001110010110111110100001111110101011110010100001011110111110111100100010111110100011110100101110111110101110001001100011101110101010110001 e6eab7eb93adecbeabe58f97efbd8befbc95e5f5aceab986eaf4a8e5be87eaf285efbc8be8f4bbeb898eeab1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)