To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???A}v???A}vB 00111111001111110011111101000001011111010111011000111111001111110011111101000001011111010111011001000010 3f3f3f417d763f3f3f417d7642
SJIS-WIN ?審こA}v?審こA}vB 0011111110010000010100101000001010110001010000010111110101110110001111111001000001010010100000101011000101000001011111010111011001000010 3f905282b1417d763f905282b1417d7642
EUC-JP ?審こA}v?審こA}vB 0011111110111111101100111010010010110011010000010111110101110110001111111011111110110011101001001011001101000001011111010111011001000010 3fbfb3a4b3417d763fbfb3a4b3417d7642
UTF-8 얇審こA}v얇審こA}vB 11101100100101101000011111100101101011111010100111100011100000011001001101000001011111010111011011101100100101101000011111100101101011111010100111100011100000011001001101000001011111010111011001000010 ec9687e5afa9e38193417d76ec9687e5afa9e38193417d7642
UHC 얇審こA}v얇審こA}vB 10111110111000111110001111111011101010101011001101000001011111010111011010111110111000111110001111111011101010101011001101000001011111010111011001000010 bee3e3fbaab3417d76bee3e3fbaab3417d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)