To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 彫?郵?彫?渦?N}彫?郵?彫?渦?N{^ 1001001010100100001111111001011101011000001111111001001010100100001111111000100101010001001111110100111001111101100100101010010000111111100101110101100000111111100100101010010000111111100010010101000100111111010011100111101101011110 92a43f97583f92a43f89513f4e7d92a43f97583f92a43f89513f4e7b5e
EUC-JP 彫?郵?彫?渦?N}彫?郵?彫?渦?N{^ 1100010010100110001111111100110110111001001111111100010010100110001111111011000110110010001111110100111001111101110001001010011000111111110011011011100100111111110001001010011000111111101100011011001000111111010011100111101101011110 c4a63fcdb93fc4a63fb1b23f4e7dc4a63fcdb93fc4a63fb1b23f4e7b5e
UTF-8 彫렣郵렭彫렣渦렭N}彫렣郵렭彫렣渦렭N{^ 1110010110111101101010111110101110100000101000111110100110000011101101011110101110100000101011011110010110111101101010111110101110100000101000111110011010111000101001101110101110100000101011010100111001111101111001011011110110101011111010111010000010100011111010011000001110110101111010111010000010101101111001011011110110101011111010111010000010100011111001101011100010100110111010111010000010101101010011100111101101011110 e5bdabeba0a3e983b5eba0ade5bdabeba0a3e6b8a6eba0ad4e7de5bdabeba0a3e983b5eba0ade5bdabeba0a3e6b8a6eba0ad4e7b5e
UHC 彫렣郵렭彫렣渦렭N}彫렣郵렭彫렣渦렭N{^ 11110000110000011000111010110100111010011110100010001110101110101111000011000001100011101011010011101000101111101000111010111010010011100111110111110000110000011000111010110100111010011110100010001110101110101111000011000001100011101011010011101000101111101000111010111010010011100111101101011110 f0c18eb4e9e88ebaf0c18eb4e8be8eba4e7df0c18eb4e9e88ebaf0c18eb4e8be8eba4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)