To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????R????^[????R????^[^ 0011111100111111001111110011111101010010001111110011111100111111001111110101111001011011001111110011111100111111001111110101001000111111001111110011111100111111010111100101101101011110 3f3f3f3f523f3f3f3f5e5b3f3f3f3f523f3f3f3f5e5b5e
SJIS-WIN 銜。鄒嗜R銜。鄒嗜^[銜。鄒嗜R銜。鄒嗜^[^ 1110011111110000101000011110011110111110100110100110111001010010111001111111000010100001111001111011111010011010011011100101111001011011111001111111000010100001111001111011111010011010011011100101001011100111111100001010000111100111101111101001101001101110010111100101101101011110 e7f0a1e7be9a6e52e7f0a1e7be9a6e5e5be7f0a1e7be9a6e52e7f0a1e7be9a6e5e5b5e
EUC-JP 銜。鄒嗜R銜。鄒嗜^[銜。鄒嗜R銜。鄒嗜^[^ 111011101111001010001110101000011110111011000000110100111100111101010010111011101111001010001110101000011110111011000000110100111100111101011110010110111110111011110010100011101010000111101110110000001101001111001111010100101110111011110010100011101010000111101110110000001101001111001111010111100101101101011110 eef28ea1eec0d3cf52eef28ea1eec0d3cf5e5beef28ea1eec0d3cf52eef28ea1eec0d3cf5e5b5e
UTF-8 銜。鄒嗜R銜。鄒嗜^[銜。鄒嗜R銜。鄒嗜^[^ 11101001100010101001110011101111101111011010000111101001100001001001001011100101100101111001110001010010111010011000101010011100111011111011110110100001111010011000010010010010111001011001011110011100010111100101101111101001100010101001110011101111101111011010000111101001100001001001001011100101100101111001110001010010111010011000101010011100111011111011110110100001111010011000010010010010111001011001011110011100010111100101101101011110 e98a9cefbda1e98492e5979c52e98a9cefbda1e98492e5979c5e5be98a9cefbda1e98492e5979c52e98a9cefbda1e98492e5979c5e5b5e
UHC 銜?鄒嗜R銜?鄒嗜^[銜?鄒嗜R銜?鄒嗜^[^ 1111100111100111001111111111010111011011110100001110111001010010111110011110011100111111111101011101101111010000111011100101111001011011111110011110011100111111111101011101101111010000111011100101001011111001111001110011111111110101110110111101000011101110010111100101101101011110 f9e73ff5dbd0ee52f9e73ff5dbd0ee5e5bf9e73ff5dbd0ee52f9e73ff5dbd0ee5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)