To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 岳??泣??邑??? 10001010011110000011111100111111100010111000001100111111001111111001011101010111001111110011111100111111 8a783f3f8b833f3f97573f3f3f
EUC-JP 岳??泣??邑??? 10110011110110010011111100111111101101011110001100111111001111111100110110111000001111110011111100111111 b3d93f3fb5e33f3fcdb83f3f3f
UTF-8 岳됰냲泣앶댖邑㎯꽬劣 111001011011001010110011111010111001000010110000111010111000001110110010111001101011001110100011111011001001010110110110111010111000110010010110111010011000001010010001111000111000111010101111111010101011110110101100111011111010011010011101 e5b2b3eb90b0eb83b2e6b3a3ec95b6eb8c96e98291e38eafeabdacefa69d
UHC 岳됰냲泣앶댖邑㎯꽬劣 1110010010111111100010011110101110000110100000101110101111101000100111011110100110001000101110101110101111101001101001111110001110000100101101111110011011101011 e4bf89eb8682ebe89de988baebe9a7e384b7e6eb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)