To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN ?里權愕?務? 0011111110010111101000101001111011011100100111001011000100111111100101101011000100111111 3f97a29edc9cb13f96b13f
EUC-JP ?里權愕?務? 0011111111001110101001001101110011011110110110001011001100111111110011001011001100111111 3fcea4dcded8b33fccb33f
UTF-8 뤵里權愕뤔務흳 111010111010010010110101111010011000011110001100111001101010110010001010111001101000010010010101111010111010010010010100111001011000101110011001111011011001110110110011 eba4b5e9878ce6ac8ae68495eba494e58b99ed9db3
UHC 뤵里權愕뤔務흳 1000111111100011110101111110110011001111111011011110010011000011100011111100010011011001111000101100010110001110 8fe3d7eccfede4c38fc4d9e2c58e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)