To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????m}????????m{^ 001111110011111100111111001111110011111100111111001111110011111101101101011111010011111100111111001111110011111100111111001111110011111100111111011011010111101101011110 3f3f3f3f3f3f3f3f6d7d3f3f3f3f3f3f3f3f6d7b5e
SJIS-WIN 逵ク訷ァ逵ク迚セm}逵ク訷ァ逵ク迚セm{^ 1110011110011100101110001111101110100100101001111110011110011100101110001110011110001001101111100110110101111101111001111001110010111000111110111010010010100111111001111001110010111000111001111000100110111110011011010111101101011110 e79cb8fba4a7e79cb8e789be6d7de79cb8fba4a7e79cb8e789be6d7b5e
EUC-JP 逵ク訷ァ逵ク迚セm}逵ク訷ァ逵ク迚セm{^ 111011011111110010001110101110001000111111011101110101001000111010100111111011011111110010001110101110001110110111101001100011101011111001101101011111011110110111111100100011101011100010001111110111011101010010001110101001111110110111111100100011101011100011101101111010011000111010111110011011010111101101011110 edfc8eb88fddd48ea7edfc8eb8ede98ebe6d7dedfc8eb88fddd48ea7edfc8eb8ede98ebe6d7b5e
UTF-8 逵ク訷ァ逵ク迚セm}逵ク訷ァ逵ク迚セm{^ 1110100110000000101101011110111110111101101110001110100010101000101101111110111110111101101001111110100110000000101101011110111110111101101110001110100010111111100110101110111110111101101111100110110101111101111010011000000010110101111011111011110110111000111010001010100010110111111011111011110110100111111010011000000010110101111011111011110110111000111010001011111110011010111011111011110110111110011011010111101101011110 e980b5efbdb8e8a8b7efbda7e980b5efbdb8e8bf9aefbdbe6d7de980b5efbdb8e8a8b7efbda7e980b5efbdb8e8bf9aefbdbe6d7b5e
UHC 逵???逵???m}逵???逵???m{^ 11010000101100000011111100111111001111111101000010110000001111110011111100111111011011010111110111010000101100000011111100111111001111111101000010110000001111110011111100111111011011010111101101011110 d0b03f3f3fd0b03f3f3f6d7dd0b03f3f3fd0b03f3f3f6d7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)