To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 腫陌??諍?音?N}腫陌??諍?音?N{^ 1000111011101110111010001001100100111111001111111110011001111001001111111000100110111001001111110100111001111101100011101110111011101000100110010011111100111111111001100111100100111111100010011011100100111111010011100111101101011110 8eeee8993f3fe6793f89b93f4e7d8eeee8993f3fe6793f89b93f4e7b5e
EUC-JP 腫陌??諍?音?N}腫陌??諍?音?N{^ 1011110011110000111011111111100100111111001111111110101111011010001111111011001010111011001111110100111001111101101111001111000011101111111110010011111100111111111010111101101000111111101100101011101100111111010011100111101101011110 bcf0eff93f3febda3fb2bb3f4e7dbcf0eff93f3febda3fb2bb3f4e7b5e
UTF-8 腫陌렭렕諍렋音렜N}腫陌렭렕諍렋音렜N{^ 1110100010000101101010111110100110011001100011001110101110100000101011011110101110100000100101011110100010101011100011011110101110100000100010111110100110011111101100111110101110100000100111000100111001111101111010001000010110101011111010011001100110001100111010111010000010101101111010111010000010010101111010001010101110001101111010111010000010001011111010011001111110110011111010111010000010011100010011100111101101011110 e885abe9998ceba0adeba095e8ab8deba08be99fb3eba09c4e7de885abe9998ceba0adeba095e8ab8deba08be99fb3eba09c4e7b5e
UHC 腫陌렭렕諍렋音렜N}腫陌렭렕諍렋音렜N{^ 11110000111111101101100011101000100011101011101010001110101010101110111010110101100011101010001011101011111001011000111010101110010011100111110111110000111111101101100011101000100011101011101010001110101010101110111010110101100011101010001011101011111001011000111010101110010011100111101101011110 f0fed8e88eba8eaaeeb58ea2ebe58eae4e7df0fed8e88eba8eaaeeb58ea2ebe58eae4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)