To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^z??????????^zB 00111111001111110011111100111111001111110011111100111111001111110011111100111111010111100111101000111111001111110011111100111111001111110011111100111111001111110011111100111111010111100111101001000010 3f3f3f3f3f3f3f3f3f3f5e7a3f3f3f3f3f3f3f3f3f3f5e7a42
SJIS-WIN 薰滂スシ﨡会スス閠ウ^z薰滂スシ﨡会スス閠ウ^zB 1111101110011110100111111110111110111101101111001111101110100000100010011110111110111101101111011110100010000000101100110101111001111010111110111001111010011111111011111011110110111100111110111010000010001001111011111011110110111101111010001000000010110011010111100111101001000010 fb9e9fefbdbcfba089efbdbde880b35e7afb9e9fefbdbcfba089efbdbde880b35e7a42
EUC-JP ?滂スシ?会スス閠ウ^z?滂スシ?会スス閠ウ^zB 0011111111011110111100011000111010111101100011101011110000111111101100101111000110001110101111011000111010111101111011111110000010001110101100110101111001111010001111111101111011110001100011101011110110001110101111000011111110110010111100011000111010111101100011101011110111101111111000001000111010110011010111100111101001000010 3fdef18ebd8ebc3fb2f18ebd8ebdefe08eb35e7a3fdef18ebd8ebc3fb2f18ebd8ebdefe08eb35e7a42
UTF-8 薰滂スシ﨡会スス閠ウ^z薰滂スシ﨡会スス閠ウ^zB 1110100010010110101100001110011010111011100000101110111110111101101111011110111110111101101111001110111110101000101000011110010010111100100110101110111110111101101111011110111110111101101111011110100110010110101000001110111110111101101100110101111001111010111010001001011010110000111001101011101110000010111011111011110110111101111011111011110110111100111011111010100010100001111001001011110010011010111011111011110110111101111011111011110110111101111010011001011010100000111011111011110110110011010111100111101001000010 e896b0e6bb82efbdbdefbdbcefa8a1e4bc9aefbdbdefbdbde996a0efbdb35e7ae896b0e6bb82efbdbdefbdbcefa8a1e4bc9aefbdbdefbdbde996a0efbdb35e7a42
UHC 薰滂????????^z薰滂????????^zB 1111110110111001110110111011010100111111001111110011111100111111001111110011111100111111001111110101111001111010111111011011100111011011101101010011111100111111001111110011111100111111001111110011111100111111010111100111101001000010 fdb9dbb53f3f3f3f3f3f3f3f5e7afdb9dbb53f3f3f3f3f3f3f3f5e7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)