To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 猷??循??猷??松?6???嗽?6率??^ 100101110101000100111111001111111000111101111010001111110011111110010111010100010011111100111111100011111011110000111111100000100101010100111111001111110011111110011010011101010011111110000010010101011001011110100110001111110011111101011110 97513f3f8f7a3f3f97513f3f8fbc3f82553f3f3f9a753f825597a63f3f5e
EUC-JP 猷??循??猷??松?6???嗽?6率??^ 110011011011001000111111001111111011110111011011001111110011111111001101101100100011111100111111101111101011111000111111101000111011011000111111001111110011111111010011110101100011111110100011101101101100111010101000001111110011111101011110 cdb23f3fbddb3f3fcdb23f3fbebe3fa3b63f3f3fd3d63fa3b6cea83f3f5e
UTF-8 猷띠툞循ㅲ뾿猷뜯뀦松쇰6痢뺛궎嗽뉖6率앪댘^ 11100111100011001011011111101011100111011010000011101101100010001001111011100101101111101010101011100011100001011011001011101011101111101011111111100111100011001011011111101011100111001010111111101011100000001010011011100110100111011011111011101100100001111011000011101111101111001001011011101111101001111010010111101011101110101001101111101010101101101000111011100101100101111011110111101011100010011001011011101111101111001001011011100111100011101000011111101100100101011010101011101011100011001001100001011110 e78cb7eb9da0ed889ee5beaae385b2ebbebfe78cb7eb9cafeb80a6e69dbeec87b0efbc96efa7a5ebba9beab68ee597bdeb8996efbc96e78e87ec95aaeb8c985e
UHC 猷띠툞循ㅲ뾿猷뜯뀦松쇰6痢뺛궎嗽뉖6率앪댘^ 11101011101000111011011011101100101110001001010111100010111000001010010011100010100101111000011111101011101000111011011011100010100001011001110111100001111001101011110011101011101000111011011011101100101110001001010111100011100000101010010011100001111101011000011111101011101000111011011011100001111000111001110111100010100010001011110001011110 eba3b6ecb895e2e0a4e29787eba3b6e2859de1e6bceba3b6ecb895e382a4e1f587eba3b6e1e39de288bc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)