To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 搖??節??褥?????玉??狎?????^ 100111011000101000111111001111111001000011011111001111110011111111100101111100010011111100111111001111110011111100111111100010111100101000111111001111111110000010111110001111110011111100111111001111110011111101011110 9d8a3f3f90df3f3fe5f13f3f3f3f3f8bca3f3fe0be3f3f3f3f3f5e
EUC-JP 搖??節??褥?????玉??狎??旿??^ 1101100111101010001111110011111111000000111000010011111100111111111010101111001100111111001111110011111100111111001111111011011011001100001111110011111111100000110000000011111100111111100011111100000111110100001111110011111101011110 d9ea3f3fc0e13f3feaf33f3f3f3f3fb6cc3f3fe0c03f3f8fc1f43f3f5e
UTF-8 搖얕쵌節삣넼褥⑵쐥嶺묌뼻玉롳슥狎띌낑旿딁븳^ 11100110100100001001011011101100100101101001010111101100101101011000110011100111101011111000000011101100100000101010001111101011100001001011110011101000101001001010010111100010100100011011010111101100100100001010010111101111101001101010101111101011101011001000110011101011101111001011101111100111100011101000100111101011101000011011001111101100100010101010010111100111100010111000111011101011100111011000110011101011100000101001000111100110100101111011111111101011100101001000000111101011101110001011001101011110 e69096ec9695ecb58ce7af80ec82a3eb84bce8a4a5e291b5ec90a5efa6abebac8cebbcbbe78e89eba1b3ec8aa5e78b8eeb9d8ceb8291e697bfeb9481ebb8b35e
UHC 搖얕쵌節삣넼褥⑵쐥嶺묌뼻玉롳슥狎띌낑旿딁븳^ 11101000111101001011111011101000101011001000111011101111101111011011101111100101100001101011011011101001101100111010100111101000100111001000101011100111101011011001000111101001100101101011111011101000101011001000111011101111101111011011101111100100111001001011011011101001101100111010100111100111111110101000101011100111100101011001110001011110 e8f4bee8ac8eefbdbbe586b6e9b3a9e89c8ae7ad91e996bee8ac8eefbdbbe4e4b6e9b3a9e7fa8ae7959c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)