To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????W}????????W{^ 001111110011111100111111001111110011111100111111001111110011111101010111011111010011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 逵ク訷ァ逵ク迚セW}逵ク訷ァ逵ク迚セW{^ 1110011110011100101110001111101110100100101001111110011110011100101110001110011110001001101111100101011101111101111001111001110010111000111110111010010010100111111001111001110010111000111001111000100110111110010101110111101101011110 e79cb8fba4a7e79cb8e789be577de79cb8fba4a7e79cb8e789be577b5e
EUC-JP 逵ク訷ァ逵ク迚セW}逵ク訷ァ逵ク迚セW{^ 111011011111110010001110101110001000111111011101110101001000111010100111111011011111110010001110101110001110110111101001100011101011111001010111011111011110110111111100100011101011100010001111110111011101010010001110101001111110110111111100100011101011100011101101111010011000111010111110010101110111101101011110 edfc8eb88fddd48ea7edfc8eb8ede98ebe577dedfc8eb88fddd48ea7edfc8eb8ede98ebe577b5e
UTF-8 逵ク訷ァ逵ク迚セW}逵ク訷ァ逵ク迚セW{^ 1110100110000000101101011110111110111101101110001110100010101000101101111110111110111101101001111110100110000000101101011110111110111101101110001110100010111111100110101110111110111101101111100101011101111101111010011000000010110101111011111011110110111000111010001010100010110111111011111011110110100111111010011000000010110101111011111011110110111000111010001011111110011010111011111011110110111110010101110111101101011110 e980b5efbdb8e8a8b7efbda7e980b5efbdb8e8bf9aefbdbe577de980b5efbdb8e8a8b7efbda7e980b5efbdb8e8bf9aefbdbe577b5e
UHC 逵???逵???W}逵???逵???W{^ 11010000101100000011111100111111001111111101000010110000001111110011111100111111010101110111110111010000101100000011111100111111001111111101000010110000001111110011111100111111010101110111101101011110 d0b03f3f3fd0b03f3f3f577dd0b03f3f3fd0b03f3f3f577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)