To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????N}?????????N{^ 0011111100111111001111110011111100111111001111110011111100111111001111110100111001111101001111110011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 齬?????訝??N}齬?????訝??N{^ 111010101001011100111111001111110011111100111111001111111110011001100010001111110011111101001110011111011110101010010111001111110011111100111111001111110011111111100110011000100011111100111111010011100111101101011110 ea973f3f3f3f3fe6623f3f4e7dea973f3f3f3f3fe6623f3f4e7b5e
EUC-JP 齬?????訝??N}齬?????訝??N{^ 111100111111011100111111001111110011111100111111001111111110101111000011001111110011111101001110011111011111001111110111001111110011111100111111001111110011111111101011110000110011111100111111010011100111101101011110 f3f73f3f3f3f3febc33f3f4e7df3f73f3f3f3f3febc33f3f4e7b5e
UTF-8 齬싦말女싨깪訝삯본N}齬싦말女싨깪訝삯본N{^ 1110100110111101101011001110110010001011101001101110101110100111100100001110111110100110100000011110110010001011101010001110101010111001101010101110100010101000100111011110110010000010101011111110101110110011101110000100111001111101111010011011110110101100111011001000101110100110111010111010011110010000111011111010011010000001111011001000101110101000111010101011100110101010111010001010100010011101111011001000001010101111111010111011001110111000010011100111101101011110 e9bdacec8ba6eba790efa681ec8ba8eab9aae8a89dec82afebb3b84e7de9bdacec8ba6eba790efa681ec8ba8eab9aae8a89dec82afebb3b84e7b5e
UHC 齬싦말女싨깪訝삯본N}齬싦말女싨깪訝삯본N{^ 1110010111100001100110101110010010111000101110111110010111111100100110101110011010000011100110101110010010111000101110111110100110111010101110110100111001111101111001011110000110011010111001001011100010111011111001011111110010011010111001101000001110011010111001001011100010111011111010011011101010111011010011100111101101011110 e5e19ae4b8bbe5fc9ae6839ae4b8bbe9babb4e7de5e19ae4b8bbe5fc9ae6839ae4b8bbe9babb4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)