To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????W}????W{^ 00111111001111110011111100111111010101110111110100111111001111110011111100111111010101110111101101011110 3f3f3f3f577d3f3f3f3f577b5e
SJIS-WIN ????W}????W{^ 00111111001111110011111100111111010101110111110100111111001111110011111100111111010101110111101101011110 3f3f3f3f577d3f3f3f3f577b5e
EUC-JP ????W}????W{^ 00111111001111110011111100111111010101110111110100111111001111110011111100111111010101110111101101011110 3f3f3f3f577d3f3f3f3f577b5e
UTF-8 횚횁횒짠W}횚횁횒짠W{^ 1110110110011010100110101110110110011010100000011110110110011010100100101110110010100111101000000101011101111101111011011001101010011010111011011001101010000001111011011001101010010010111011001010011110100000010101110111101101011110 ed9a9aed9a81ed9a92eca7a0577ded9a9aed9a81ed9a92eca7a0577b5e
UHC 횚횁횒짠W}횚횁횒짠W{^ 110000111001010011000011100000011100001110001101110000101010011101010111011111011100001110010100110000111000000111000011100011011100001010100111010101110111101101011110 c394c381c38dc2a7577dc394c381c38dc2a7577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)