To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???妖??妖??伊??珥??懿??溢μ?B 0011111100111111001111111001011101100100001111110011111110010111011001000011111100111111100010001100100100111111001111111110000011100000001111110011111110011100111100100011111100111111100010001110110010000011110010100011111101000010 3f3f3f97643f3f97643f3f88c93f3fe0e03f3f9cf23f3f88ec83ca3f42
EUC-JP ???妖??妖??伊??珥??懿??溢μ?B 0011111100111111001111111100110111000101001111110011111111001101110001010011111100111111101100001100101100111111001111111110000011100010001111110011111111011000111101000011111100111111101100001110111010100110110011000011111101000010 3f3f3fcdc53f3fcdc53f3fb0cb3f3fe0e23f3fd8f43f3fb0eea6cc3f42
UTF-8 琉깆큹妖껋븬妖껎맓伊껊ㅁ珥덆룂懿뺡룂溢μ쪉B 111011111010011110001100111010101011100110000110111011011000000110111001111001011010011010010110111010101011101110001011111010111011100010101100111001011010011010010110111010101011101110001110111010111010011110010011111001001011110010001010111010101011101110001010111000111000010110000001111001111000111110100101111010111000110110000110111010111010001110000010111001101000011110111111111010111011101010100001111010111010001110000010111001101011101010100010110011101011110011101100101010101000100101000010 efa78ceab986ed81b9e5a696eabb8bebb8ace5a696eabb8eeba793e4bc8aeabb8ae38581e78fa5eb8d86eba382e687bfebbaa1eba382e6baa2cebcecaa8942
UHC 琉깆큹妖껋븬妖껎맓伊껊ㅁ珥덆룂懿뺡룂溢μ쪉B 11101011101001001011000111101100101101001000100011101000111011011000001111101100100101011001010111101000111011011000001111101101100100001010010111101100101001011000001111101011101001001011000111101100101101001000100011101001100011111000001111101011111100111001010111101001100011111000001111101100111011101010010111101100101001011000001101000010 eba4b1ecb488e8ed83ec9595e8ed83ed90a5eca583eba4b1ecb488e98f83ebf395e98f83eceea5eca58342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)