To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 雋ゆシ夲スセ蜥丞妺雋ゆシ夲スセ蓿乗初B 11101000101100101000001011100100101111001001101011101111101111011011111011100101100100101000111111100101111110101010010111101000101100101000001011100100101111001001101011101111101111011011111011100100111100001000111111100110100011111000100101000010 e8b282e4bc9aefbdbee5928fe5faa5e8b282e4bc9aefbdbee4f08fe68f8942
EUC-JP 雋ゆシ夲スセ蜥丞妺雋ゆシ夲スセ蓿乗初B 1111000010110100101001001110011010001110101111001101010011110001100011101011110110001110101111101110100111110010101111101110011110001111101110011011011111110000101101001010010011100110100011101011110011010100111100011000111010111101100011101011111011101000111100101011111011101000101111011110100101000010 f0b4a4e68ebcd4f18ebd8ebee9f2bee78fb9b7f0b4a4e68ebcd4f18ebd8ebee8f2bee8bde942
UTF-8 雋ゆシ夲スセ蜥丞妺雋ゆシ夲スセ蓿乗初B 11101001100110111000101111100011100000101000011011101111101111011011110011100101101001001011001011101111101111011011110111101111101111011011111011101000100111001010010111100100101110001001111011100101101001101011101011101001100110111000101111100011100000101000011011101111101111011011110011100101101001001011001011101111101111011011110111101111101111011011111011101000100100111011111111100100101110011001011111100101100010001001110101000010 e99b8be38286efbdbce5a4b2efbdbdefbdbee89ca5e4b89ee5a6bae99b8be38286efbdbce5a4b2efbdbdefbdbee893bfe4b997e5889d42
UHC 雋ゆ?????丞?雋ゆ??????初B 11110001111001101010101011100110001111110011111100111111001111110011111111100011101010100011111111110001111001101010101011100110001111110011111100111111001111110011111100111111111101001111100001000010 f1e6aae63f3f3f3f3fe3aa3ff1e6aae63f3f3f3f3f3ff4f842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)