To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 遏齢瞳贒オ遏齢お薰ィ遏齢瞳贒オ遏齢お薰ィB 11100111100111111001011111101110100100111011010111111011101011111011010111100111100111111001011111101110100000101010100011111011100111101010100011100111100111111001011111101110100100111011010111111011101011111011010111100111100111111001011111101110100000101010100011111011100111101010100001000010 e79f97ee93b5fbafb5e79f97ee82a8fb9ea8e79f97ee93b5fbafb5e79f97ee82a8fb9ea842
EUC-JP 遏齢瞳贒オ遏齢お?ィ遏齢瞳贒オ遏齢お?ィB 1110111010100001110011101111000011000110101101111000111111011111110000111000111010110101111011101010000111001110111100001010010010101010001111111000111010101000111011101010000111001110111100001100011010110111100011111101111111000011100011101011010111101110101000011100111011110000101001001010101000111111100011101010100001000010 eea1cef0c6b78fdfc38eb5eea1cef0a4aa3f8ea8eea1cef0c6b78fdfc38eb5eea1cef0a4aa3f8ea842
UTF-8 遏齢瞳贒オ遏齢お薰ィ遏齢瞳贒オ遏齢お薰ィB 11101001100000011000111111101001101111011010001011100111100111101011001111101000101101001001001011101111101111011011010111101001100000011000111111101001101111011010001011100011100000011000101011101000100101101011000011101111101111011010100011101001100000011000111111101001101111011010001011100111100111101011001111101000101101001001001011101111101111011011010111101001100000011000111111101001101111011010001011100011100000011000101011101000100101101011000011101111101111011010100001000010 e9818fe9bda2e79eb3e8b492efbdb5e9818fe9bda2e3818ae896b0efbda8e9818fe9bda2e79eb3e8b492efbdb5e9818fe9bda2e3818ae896b0efbda842
UHC ??瞳????お薰???瞳????お薰?B 001111110011111111010100110110100011111100111111001111110011111110101010101010101111110110111001001111110011111100111111110101001101101000111111001111110011111100111111101010101010101011111101101110010011111101000010 3f3fd4da3f3f3f3faaaafdb93f3f3fd4da3f3f3f3faaaafdb93f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)