To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ûM¨ûž­ûM¨çŸ£n}ûM¨ûž­ûM¨çŸ£n{^ 1111101101001101101010001111101110011110101011011111101101001101101010001110011110011111101000110110111001111101111110110100110110101000111110111001111010101101111110110100110110101000111001111001111110100011011011100111101101011110 fb4da8fb9eadfb4da8e79fa36e7dfb4da8fb9eadfb4da8e79fa36e7b5e
SJIS-WIN ?M¨????M¨??£n}?M¨????M¨??£n{^ 0011111101001101100000010100111000111111001111110011111100111111010011011000000101001110001111110011111110000001100100100110111001111101001111110100110110000001010011100011111100111111001111110011111101001101100000010100111000111111001111111000000110010010011011100111101101011110 3f4d814e3f3f3f3f4d814e3f3f81926e7d3f4d814e3f3f3f3f4d814e3f3f81926e7b5e
EUC-JP ûM¨û??ûM¨ç?£n}ûM¨û??ûM¨ç?£n{^ 100011111010101111100101010011011010000110101111100011111010101111100101001111110011111110001111101010111110010101001101101000011010111110001111101010111010111000111111101000011111001001101110011111011000111110101011111001010100110110100001101011111000111110101011111001010011111100111111100011111010101111100101010011011010000110101111100011111010101110101110001111111010000111110010011011100111101101011110 8fabe54da1af8fabe53f3f8fabe54da1af8fabae3fa1f26e7d8fabe54da1af8fabe53f3f8fabe54da1af8fabae3fa1f26e7b5e
UTF-8 ûM¨ûž­ûM¨çŸ£n}ûM¨ûž­ûM¨çŸ£n{^ 11000011101110110100110111000010101010001100001110111011110000101001111011000010101011011100001110111011010011011100001010101000110000111010011111000010100111111100001010100011011011100111110111000011101110110100110111000010101010001100001110111011110000101001111011000010101011011100001110111011010011011100001010101000110000111010011111000010100111111100001010100011011011100111101101011110 c3bb4dc2a8c3bbc29ec2adc3bb4dc2a8c3a7c29fc2a36e7dc3bb4dc2a8c3bbc29ec2adc3bb4dc2a8c3a7c29fc2a36e7b5e
UHC ?M¨??­?M¨???n}?M¨??­?M¨???n{^ 0011111101001101101000011010011100111111001111111010000110101001001111110100110110100001101001110011111100111111001111110110111001111101001111110100110110100001101001110011111100111111101000011010100100111111010011011010000110100111001111110011111100111111011011100111101101011110 3f4da1a73f3fa1a93f4da1a73f3f3f6e7d3f4da1a73f3fa1a93f4da1a73f3f3f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)