To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???v???vB 001111110011111100111111011101100011111100111111001111110111011001000010 3f3f3f763f3f3f7642
SJIS-WIN ?醒?v?醒?vB 0011111110010000110000010011111101110110001111111001000011000001001111110111011001000010 3f90c13f763f90c13f7642
EUC-JP ?醒?v?醒?vB 0011111111000000110000110011111101110110001111111100000011000011001111110111011001000010 3fc0c33f763fc0c33f7642
UTF-8 짊醒둥v짊醒둥vB 111011001010011110001010111010011000011010010010111010111001000110100101011101101110110010100111100010101110100110000110100100101110101110010001101001010111011001000010 eca78ae98692eb91a576eca78ae98692eb91a57642
UHC 짊醒둥v짊醒둥vB 110000011111101111100001101001011011010111010101011101101100000111111011111000011010010110110101110101010111011001000010 c1fbe1a5b5d576c1fbe1a5b5d57642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)