To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 陝抵シ宿陝抵シ叔N}陝抵シ宿陝抵シ叔N{^ 111010001001111110010010111011111011110010001111011010001110100010011111100100101110111110111100100011110110011001001110011111011110100010011111100100101110111110111100100011110110100011101000100111111001001011101111101111001000111101100110010011100111101101011110 e89f92efbc8f68e89f92efbc8f664e7de89f92efbc8f68e89f92efbc8f664e7b5e
EUC-JP 陝抵シ宿陝抵シ叔N}陝抵シ宿陝抵シ叔N{^ 11110000101000011100010011110001100011101011110010111101110010011111000010100001110001001111000110001110101111001011110111000111010011100111110111110000101000011100010011110001100011101011110010111101110010011111000010100001110001001111000110001110101111001011110111000111010011100111101101011110 f0a1c4f18ebcbdc9f0a1c4f18ebcbdc74e7df0a1c4f18ebcbdc9f0a1c4f18ebcbdc74e7b5e
UTF-8 陝抵シ宿陝抵シ叔N}陝抵シ宿陝抵シ叔N{^ 1110100110011001100111011110011010001010101101011110111110111101101111001110010110101110101111111110100110011001100111011110011010001010101101011110111110111101101111001110010110001111100101000100111001111101111010011001100110011101111001101000101010110101111011111011110110111100111001011010111010111111111010011001100110011101111001101000101010110101111011111011110110111100111001011000111110010100010011100111101101011110 e9999de68ab5efbdbce5aebfe9999de68ab5efbdbce58f944e7de9999de68ab5efbdbce5aebfe9999de68ab5efbdbce58f944e7b5e
UHC 陝抵?宿陝抵?叔N}陝抵?宿陝抵?叔N{^ 111000001110110111101110101111010011111111100010110101101110000011101101111011101011110100111111111000101101001001001110011111011110000011101101111011101011110100111111111000101101011011100000111011011110111010111101001111111110001011010010010011100111101101011110 e0edeebd3fe2d6e0edeebd3fe2d24e7de0edeebd3fe2d6e0edeebd3fe2d24e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)