To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 陋ケ謚題峪鬢蛙 11101000100110111011100111100110100010101001000111101000100110111011100111101001101001001000101001011110 e89bb9e68a91e89bb9e9a48a5e
EUC-JP 陋ケ謚題峪鬢蛙 1110111111111011100011101011100111101011111010101100001011101010110101101011101111110010101001101011001110111111 effb8eb9ebeac2ead6bbf2a6b3bf
UTF-8 陋ケ謚題峪鬢蛙 111010011001100110001011111011111011110110111001111010001010110010011010111010011010000110001100111001011011001110101010111010011010110010100010111010001001101110011001 e9998befbdb9e8ac9ae9a18ce5b3aae9aca2e89b99
UHC 陋?謚題??蛙 1101011110110000001111111110110011010000111100001011100100111111001111111110100011000011 d7b03fecd0f0b93f3fe8c3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)