To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN ?る?遊??諛 00111111100000101110100100111111100101110101011000111111001111111110011010000111 3f82e93f97563f3fe687
EUC-JP ?る?遊??諛 00111111101001001110101100111111110011011011011100111111001111111110101111100111 3fa4eb3fcdb73f3febe7
UTF-8 閭る베遊삡슭諛 111011111010011010000110111000111000001010001011111010111011001010100000111010011000000110001010111011001000001010100001111011001000101010101101111010001010101110011011 efa686e3828bebb2a0e9818aec82a1ec8aade8ab9b
UHC 閭る베遊삡슭諛 1110011010101101101010101110101110111010101000111110101110110100101110111110010010111101101111101110101110110000 e6adaaebbaa3ebb4bbe4bdbeebb0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)