To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 堊??愿?、乳 1001101010111111001111110011111110011100110000110011111110000001010000011001001111111011 9abf3f3f9cc33f814193fb
EUC-JP 堊??愿?、乳 1101010011000001001111110011111111011000110001010011111110100001101000101100011011111101 d4c13f3fd8c53fa1a2c6fd
UTF-8 堊앾퐡愿⒴、乳 111001011010000010001010111011001001010110111110111011011001000010100001111001101000010010111111111000101001001010110100111000111000000010000001111001001011100110110011 e5a08aec95beed90a1e684bfe292b4e38081e4b9b3
UHC 堊앾퐡愿⒴、乳 1110010010111110100111011110111110111101100010101110101010110100101010011110010110100001101000101110101011100001 e4be9defbd8aeab4a9e5a1a2eae1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)