To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 暗??泣??純??B 10001000110000110011111100111111100010111000001100111111001111111000111110000011001111110011111101000010 88c33f3f8b833f3f8f833f3f42
EUC-JP 暗??泣??純??B 10110000110001010011111100111111101101011110001100111111001111111011110111100011001111110011111101000010 b0c53f3fb5e33f3fbde33f3f42
UTF-8 暗삳툦泣섉쾬純놁뒛B 11100110100110101001011111101100100000101011001111101101100010001010011011100110101100111010001111101100100001001000100111101100101111101010110011100111101101001001010011101011100001101000000111101011100100101001101101000010 e69a97ec82b3ed88a6e6b3a3ec8489ecbeace7b494eb8681eb929b42
UHC 暗삳툦泣섉쾬純놁뒛B 11100100110111101011101111101011101110001001110111101011111010001001100011100110101100101000001111100010111011011000011011101100100010101001100001000010 e4debbebb89debe898e6b283e2ed86ec8a9842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)