To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????R????^[????R????^[^ 0011111100111111001111110011111101010010001111110011111100111111001111110101111001011011001111110011111100111111001111110101001000111111001111110011111100111111010111100101101101011110 3f3f3f3f523f3f3f3f5e5b3f3f3f3f523f3f3f3f5e5b5e
SJIS-WIN 災???R災???^[災???R災???^[^ 100011011101000000111111001111110011111101010010100011011101000000111111001111110011111101011110010110111000110111010000001111110011111100111111010100101000110111010000001111110011111100111111010111100101101101011110 8dd03f3f3f528dd03f3f3f5e5b8dd03f3f3f528dd03f3f3f5e5b5e
EUC-JP 災???R災???^[災???R災???^[^ 101110101101001000111111001111110011111101010010101110101101001000111111001111110011111101011110010110111011101011010010001111110011111100111111010100101011101011010010001111110011111100111111010111100101101101011110 bad23f3f3f52bad23f3f3f5e5bbad23f3f3f52bad23f3f3f5e5b5e
UTF-8 災곡렔렪R災곡렔렪^[災곡렔렪R災곡렔렪^[^ 11100111100000011011110111101010101100111010000111101011101000001001010011101011101000001010101001010010111001111000000110111101111010101011001110100001111010111010000010010100111010111010000010101010010111100101101111100111100000011011110111101010101100111010000111101011101000001001010011101011101000001010101001010010111001111000000110111101111010101011001110100001111010111010000010010100111010111010000010101010010111100101101101011110 e781bdeab3a1eba094eba0aa52e781bdeab3a1eba094eba0aa5e5be781bdeab3a1eba094eba0aa52e781bdeab3a1eba094eba0aa5e5b5e
UHC 災곡렔렪R災곡렔렪^[災곡렔렪R災곡렔렪^[^ 111011101010110010110000111011101000111010101001100011101011100001010010111011101010110010110000111011101000111010101001100011101011100001011110010110111110111010101100101100001110111010001110101010011000111010111000010100101110111010101100101100001110111010001110101010011000111010111000010111100101101101011110 eeacb0ee8ea98eb852eeacb0ee8ea98eb85e5beeacb0ee8ea98eb852eeacb0ee8ea98eb85e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)