To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????h 0011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f68
SJIS-WIN ?學幀存?鈍渡h 00111111100110110111101110011011111010101001000110110110001111111001001111011101100100110110111001101000 3f9b7b9bea91b63f93dd936e68
EUC-JP ?學幀存?鈍渡h 00111111110101011101110011010110111011001100001010111000001111111100011011011111110001011100111101101000 3fd5dcd6ecc2b83fc6dfc5cf68
UTF-8 뤋學幀存샘鈍渡h 11101011101001001000101111100101101011011011100011100101101110011000000011100101101011011001100011101100100000111001100011101001100010001000110111100110101110001010000101101000 eba48be5adb8e5b980e5ad98ec8398e9888de6b8a168
UHC 뤋學幀存샘鈍渡h 100011111011101111111001110010101110111111010011111100001110110110111011111110011101010011101111110101001010010001101000 8fbbf9caefd3f0edbbf9d4efd4a468

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)