To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 霎」謇カ霎」蠎弯 11101000101111101010001111100110100010011011011011101000101111101010001111100101101110101001110001011110 e8bea3e689b6e8bea3e5ba9c5e
EUC-JP 霎」謇カ霎」蠎弯 11110000110000001000111010100011111010111110100110001110101101101111000011000000100011101010001111101010101111001101011110111111 f0c08ea3ebe98eb6f0c08ea3eabcd7bf
UTF-8 霎」謇カ霎」蠎弯 111010011001110010001110111011111011110110100011111010001010110010000111111011111011110110110110111010011001110010001110111011111011110110100011111010001010000010001110111001011011110010101111 e99c8eefbda3e8ac87efbdb6e99c8eefbda3e8a08ee5bcaf
UHC ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)