To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 悌???氈???止 100100101110111000111111001111110011111110011111100000010011111100111111001111111000111001111110 92ee3f3f3f9f813f3f3f8e7e
EUC-JP 悌???氈???止 110001001111000000111111001111110011111111011101111000010011111100111111001111111011101111011111 c4f03f3f3fdde13f3f3fbbdf
UTF-8 悌쿰렰렎氈폈렰렩止 111001101000001010001100111011001011111110110000111010111010000010110000111010111010000010001110111001101011000010001000111011011000111110001000111010111010000010110000111010111010000010101001111001101010110110100010 e6828cecbfb0eba0b0eba08ee6b088ed8f88eba0b0eba0a9e6ada2
UHC 悌쿰렰렎氈폈렰렩止 111100001010101011000100111100011000111010111101100011101010010011101110111111011100011011110001100011101011110110001110101101111111001010101101 f0aac4f18ebd8ea4eefdc6f18ebd8eb7f2ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)