To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN ??氓???氓?B 0011111100111111100111111000001000111111001111110011111110011111100000100011111101000010 3f3f9f823f3f3f9f823f42
EUC-JP ??氓???氓?B 0011111100111111110111011110001000111111001111110011111111011101111000100011111101000010 3f3fdde23f3f3fdde23f42
UTF-8 뤚춲氓햕뤚춲氓햕B 11101011101001001001101011101100101101101011001011100110101100001001001111101101100101101001010111101011101001001001101011101100101101101011001011100110101100001001001111101101100101101001010101000010 eba49aecb6b2e6b093ed9695eba49aecb6b2e6b093ed969542
UHC 뤚춲氓햕뤚춲氓햕B 1000111111001001101011011000111011011000111011001100000101101001100011111100100110101101100011101101100011101100110000010110100101000010 8fc9ad8ed8ecc1698fc9ad8ed8ecc16942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)