To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???宥??魏??辱 00111111001111110011111110010111010001110011111100111111111010011011000000111111001111111001000001001010 3f3f3f97473f3fe9b03f3f904a
EUC-JP ???宥??魏??辱 00111111001111110011111111001101101010000011111100111111111100101011001000111111001111111011111110101011 3f3f3fcda83f3ff2b23f3fbfab
UTF-8 娛뤴뫁宥딀콨魏놁뵏辱 111001011010100010011011111010111010010010110100111010111010101110000001111001011010111010100101111010111001010010000000111011001011110110101000111010011010110110001111111010111000011010000001111010111011010110001111111010001011111010110001 e5a89beba4b4ebab81e5aea5eb9480ecbda8e9ad8feb8681ebb58fe8beb1
UHC 娛뤴뫁宥딀콨魏놁뵏辱 1110011111110100100011111110001010010001101001011110101011101001100010101110011010110001100111011110101011100000100001101110110010010100100100101110100110110100 e7f48fe291a5eae98ae6b19deae086ec9492e9b4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)