To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????m}v??????????m}vB 001111110011111100111111001111110011111100111111001111110011111100111111001111110110110101111101011101100011111100111111001111110011111100111111001111110011111100111111001111110011111101101101011111010111011001000010 3f3f3f3f3f3f3f3f3f3f6d7d763f3f3f3f3f3f3f3f3f3f6d7d7642
SJIS-WIN 趙丈ケ樞゚占クエ雎「m}v趙丈ケ樞゚占クエ雎「m}vB 11100110111000101000111111100100101110011001111011100010110111111001000011101000101110001011010011101000101100011010001001101101011111010111011011100110111000101000111111100100101110011001111011100010110111111001000011101000101110001011010011101000101100011010001001101101011111010111011001000010 e6e28fe4b99ee2df90e8b8b4e8b1a26d7d76e6e28fe4b99ee2df90e8b8b4e8b1a26d7d7642
EUC-JP 趙丈ケ樞゚占クエ雎「m}v趙丈ケ樞゚占クエ雎「m}vB 1110110011100100101111101110011010001110101110011101110011100100100011101101111111000000111010101000111010111000100011101011010011110000101100111000111010100010011011010111110101110110111011001110010010111110111001101000111010111001110111001110010010001110110111111100000011101010100011101011100010001110101101001111000010110011100011101010001001101101011111010111011001000010 ece4bee68eb9dce48edfc0ea8eb88eb4f0b38ea26d7d76ece4bee68eb9dce48edfc0ea8eb88eb4f0b38ea26d7d7642
UTF-8 趙丈ケ樞゚占クエ雎「m}v趙丈ケ樞゚占クエ雎「m}vB 11101000101101101001100111100100101110001000100011101111101111011011100111100110101010001001111011101111101111101001111111100101100011011010000011101111101111011011100011101111101111011011010011101001100110111000111011101111101111011010001001101101011111010111011011101000101101101001100111100100101110001000100011101111101111011011100111100110101010001001111011101111101111101001111111100101100011011010000011101111101111011011100011101111101111011011010011101001100110111000111011101111101111011010001001101101011111010111011001000010 e8b699e4b888efbdb9e6a89eefbe9fe58da0efbdb8efbdb4e99b8eefbda26d7d76e8b699e4b888efbdb9e6a89eefbe9fe58da0efbdb8efbdb4e99b8eefbda26d7d7642
UHC 趙丈?樞?占??雎?m}v趙丈?樞?占??雎?m}vB 11110000111000011110110111011011001111111111010111010010001111111110111110111111001111110011111111101110110100010011111101101101011111010111011011110000111000011110110111011011001111111111010111010010001111111110111110111111001111110011111111101110110100010011111101101101011111010111011001000010 f0e1eddb3ff5d23fefbf3f3feed13f6d7d76f0e1eddb3ff5d23fefbf3f3feed13f6d7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)