To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????音??沃??竊??喩?????肉 001111110011111100111111001111110011111100111111100010011011100100111111001111111001011110000000001111110011111111100010100001100011111100111111100110100110011100111111001111110011111100111111001111111001001111110111 3f3f3f3f3f3f89b93f3f97803f3fe2863f3f9a673f3f3f3f3f93f7
EUC-JP ???絪??音??沃??竊??喩?????肉 0011111100111111001111111000111111010011111011000011111100111111101100101011101100111111001111111100110111100000001111110011111111100011111001100011111100111111110100111100100000111111001111110011111100111111001111111100011011111001 3f3f3f8fd3ec3f3fb2bb3f3fcde03f3fe3e63f3fd3c83f3f3f3f3fc6f9
UTF-8 列룸쓷絪득씭音곤폇沃쇰뜄竊뗩뼸喩롪굻列룔꺆肉 111011111010011010011100111010111010001110111000111011001001001110110111111001111011010110101010111010111001001110011101111011001001010010101101111010011001111110110011111010101011001110100100111011011000111110000111111001101011001010000011111011001000011110110000111010111001110010000100111001111010101110001010111010111001011110101001111010111011110010111000111001011001011010101001111010111010000110101010111010101011010110111011111011111010011010011100111010111010001110010100111010101011101010000110111010001000001010001001 efa69ceba3b8ec93b7e7b5aaeb939dec94ade99fb3eab3a4ed8f87e6b283ec87b0eb9c84e7ab8aeb97a9ebbcb8e596a9eba1aaeab5bbefa69ceba394eaba86e88289
UHC 列룸쓷絪득씭音곤폇沃쇰뜄竊뗩뼸喩롪굻列룔꺆肉 1110011011101010101101111110101110011101100101001110110011011111101101011110011010011101101111101110101111100101101100001110111110111100100101001110100010101010101111001110101110001101100010001110111110111100100010111110100110010110101110111110101011100111100011101110101010110001101111111110011011101010101101111110001110000011101011011110101110111111 e6eab7eb9d94ecdfb5e69dbeebe5b0efbc94e8aabceb8d88efbc8be996bbeae78eeab1bfe6eab7e383adebbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)