To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???W^???\}v???W^???\}vB 0011111100111111001111110101011101011110001111110011111100111111010111000111110101110110001111110011111100111111010101110101111000111111001111110011111101011100011111010111011001000010 3f3f3f575e3f3f3f5c7d763f3f3f575e3f3f3f5c7d7642
SJIS-WIN 鳥?悲W^鳥?悲\}v鳥?悲W^鳥?悲\}vB 10010010101110010011111110010100110111110101011101011110100100101011100100111111100101001101111101011100011111010111011010010010101110010011111110010100110111110101011101011110100100101011100100111111100101001101111101011100011111010111011001000010 92b93f94df575e92b93f94df5c7d7692b93f94df575e92b93f94df5c7d7642
EUC-JP 鳥?悲W^鳥?悲\}v鳥?悲W^鳥?悲\}vB 11000100101110110011111111001000111000010101011101011110110001001011101100111111110010001110000101011100011111010111011011000100101110110011111111001000111000010101011101011110110001001011101100111111110010001110000101011100011111010111011001000010 c4bb3fc8e1575ec4bb3fc8e15c7d76c4bb3fc8e1575ec4bb3fc8e15c7d7642
UTF-8 鳥흙悲W^鳥흙悲\}v鳥흙悲W^鳥흙悲\}vB 1110100110110011101001011110110110011101100110011110011010000010101100100101011101011110111010011011001110100101111011011001110110011001111001101000001010110010010111000111110101110110111010011011001110100101111011011001110110011001111001101000001010110010010101110101111011101001101100111010010111101101100111011001100111100110100000101011001001011100011111010111011001000010 e9b3a5ed9d99e682b2575ee9b3a5ed9d99e682b25c7d76e9b3a5ed9d99e682b2575ee9b3a5ed9d99e682b25c7d7642
UHC 鳥흙悲W^鳥흙悲\}v鳥흙悲W^鳥흙悲\}vB 1111000011101000110010001110101111011101111010000101011101011110111100001110100011001000111010111101110111101000010111000111110101110110111100001110100011001000111010111101110111101000010101110101111011110000111010001100100011101011110111011110100001011100011111010111011001000010 f0e8c8ebdde8575ef0e8c8ebdde85c7d76f0e8c8ebdde8575ef0e8c8ebdde85c7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)