To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????[????????[^ 00111111001111110011111100111111001111110011111100111111001111110101101100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 遮舎酌縞遮舎蔀赦[遮舎酌縞遮舎蔀赦[^ 1000111011010101100011101100100110001110110111101000111011001000100011101101010110001110110010011000111011000001100011101100110101011011100011101101010110001110110010011000111011011110100011101100100010001110110101011000111011001001100011101100000110001110110011010101101101011110 8ed58ec98ede8ec88ed58ec98ec18ecd5b8ed58ec98ede8ec88ed58ec98ec18ecd5b5e
EUC-JP 遮舎酌縞遮舎蔀赦[遮舎酌縞遮舎蔀赦[^ 1011110011010111101111001100101110111100111000001011110011001010101111001101011110111100110010111011110011000011101111001100111101011011101111001101011110111100110010111011110011100000101111001100101010111100110101111011110011001011101111001100001110111100110011110101101101011110 bcd7bccbbce0bccabcd7bccbbcc3bccf5bbcd7bccbbce0bccabcd7bccbbcc3bccf5b5e
UTF-8 遮舎酌縞遮舎蔀赦[遮舎酌縞遮舎蔀赦[^ 111010011000000110101110111010001000100010001110111010011000010110001100111001111011100010011110111010011000000110101110111010001000100010001110111010001001010010000000111010001011010110100110010110111110100110000001101011101110100010001000100011101110100110000101100011001110011110111000100111101110100110000001101011101110100010001000100011101110100010010100100000001110100010110101101001100101101101011110 e981aee8888ee9858ce7b89ee981aee8888ee89480e8b5a65be981aee8888ee9858ce7b89ee981aee8888ee89480e8b5a65b5e
UHC 遮?酌縞遮??赦[遮?酌縞遮??赦[^ 1111001110110100001111111110110111001100111110111101011011110011101101000011111100111111110111101111010101011011111100111011010000111111111011011100110011111011110101101111001110110100001111110011111111011110111101010101101101011110 f3b43fedccfbd6f3b43f3fdef55bf3b43fedccfbd6f3b43f3fdef55b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)