To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????d}????????d{^ 001111110011111100111111001111110011111100111111001111110011111101100100011111010011111100111111001111110011111100111111001111110011111100111111011001000111101101011110 3f3f3f3f3f3f3f3f647d3f3f3f3f3f3f3f3f647b5e
SJIS-WIN 逵ク訷ァ逵ク閠。d}逵ク訷ァ逵ク閠。d{^ 1110011110011100101110001111101110100100101001111110011110011100101110001110100010000000101000010110010001111101111001111001110010111000111110111010010010100111111001111001110010111000111010001000000010100001011001000111101101011110 e79cb8fba4a7e79cb8e880a1647de79cb8fba4a7e79cb8e880a1647b5e
EUC-JP 逵ク訷ァ逵ク閠。d}逵ク訷ァ逵ク閠。d{^ 111011011111110010001110101110001000111111011101110101001000111010100111111011011111110010001110101110001110111111100000100011101010000101100100011111011110110111111100100011101011100010001111110111011101010010001110101001111110110111111100100011101011100011101111111000001000111010100001011001000111101101011110 edfc8eb88fddd48ea7edfc8eb8efe08ea1647dedfc8eb88fddd48ea7edfc8eb8efe08ea1647b5e
UTF-8 逵ク訷ァ逵ク閠。d}逵ク訷ァ逵ク閠。d{^ 1110100110000000101101011110111110111101101110001110100010101000101101111110111110111101101001111110100110000000101101011110111110111101101110001110100110010110101000001110111110111101101000010110010001111101111010011000000010110101111011111011110110111000111010001010100010110111111011111011110110100111111010011000000010110101111011111011110110111000111010011001011010100000111011111011110110100001011001000111101101011110 e980b5efbdb8e8a8b7efbda7e980b5efbdb8e996a0efbda1647de980b5efbdb8e8a8b7efbda7e980b5efbdb8e996a0efbda1647b5e
UHC 逵???逵???d}逵???逵???d{^ 11010000101100000011111100111111001111111101000010110000001111110011111100111111011001000111110111010000101100000011111100111111001111111101000010110000001111110011111100111111011001000111101101011110 d0b03f3f3fd0b03f3f3f647dd0b03f3f3fd0b03f3f3f647b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)