To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 瓦わ?曜ら?節①?瓦わ?曜ら?節③?^ 10001010101000101000001011101101001111111001011101101010100000101110011100111111100100001101111110000111010000000011111110001010101000101000001011101101001111111001011101101010100000101110011100111111100100001101111110000111010000100011111101011110 8aa282ed3f976a82e73f90df87403f8aa282ed3f976a82e73f90df87423f5e
EUC-JP 瓦わ?曜ら?節??瓦わ?曜ら?節??^ 1011010010100100101001001110111100111111110011011100101110100100111010010011111111000000111000010011111100111111101101001010010010100100111011110011111111001101110010111010010011101001001111111100000011100001001111110011111101011110 b4a4a4ef3fcdcba4e93fc0e13f3fb4a4a4ef3fcdcba4e93fc0e13f3f5e
UTF-8 瓦わ쉘曜ら줁節①쵟瓦わ쉘曜ら줁節③죳^ 11100111100100111010011011100011100000101000111111101100100010011001100011100110100110111001110011100011100000101000100111101100101001001000000111100111101011111000000011100010100100011010000011101100101101011001111111100111100100111010011011100011100000101000111111101100100010011001100011100110100110111001110011100011100000101000100111101100101001001000000111100111101011111000000011100010100100011010001011101100101000111011001101011110 e793a6e3828fec8998e69b9ce38289eca481e7af80e291a0ecb59fe793a6e3828fec8998e69b9ce38289eca481e7af80e291a2eca3b35e
UHC 瓦わ쉘曜ら줁節①쵟瓦わ쉘曜ら줁節③죳^ 11101000101111111010101011101111101111011010100111101000111110001010101011101001101000011001100011101111101111011010100011100111101011001010000011101000101111111010101011101111101111011010100111101000111110001010101011101001101000011001100011101111101111011010100011101001101000011000111001011110 e8bfaaefbda9e8f8aae9a198efbda8e7aca0e8bfaaefbda9e8f8aae9a198efbda8e9a18e5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)