To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8倚?┨節?????違ゅ?循??塋??? 11100001100111110011111110000010010101111001100011011111001111111000010010110111100100001101111100111111001111110011111100111111001111111000100011100001100000101110001100111111100011110111101000111111001111111001101011001000001111110011111100111111 e19f3f825798df3f84b790df3f3f3f3f3f88e182e33f8f7a3f3f9ac83f3f3f
EUC-JP 癲?8倚?┨節??孼?ħ違ゅ?循??塋??嫄 11100010101000010011111110100011101110001101000011100001001111111010100010111001110000001110000100111111001111111000111110111010110000110011111110001111101010011100010010110000111000111010010011100101001111111011110111011011001111110011111111010100110010100011111100111111100011111011101010100001 e2a13fa3b8d0e13fa8b9c0e13f3f8fbac33f8fa9c4b0e3a4e53fbddb3f3fd4ca3f3f8fbaa1
UTF-8 癲쒕8倚싮┨節뉖쇀孼꾨ħ違ゅ춢循뚯뒠塋딅틶嫄 1110011110011001101100101110110010010010100101011110111110111100100110001110010110000000100110101110110010001011101011101110001010010100101010001110011110101111100000001110101110001001100101101110110010000111100000001110010110101101101111001110101010111110101010001100010010100111111010011000000110010101111000111000001010000101111011001011011010100010111001011011111010101010111010111001101010101111111010111001001010100000111001011010000110001011111010111001010010000101111011011000101110110110111001011010101110000100 e799b2ec9295efbc98e5809aec8baee294a8e7af80eb8996ec8780e5adbceabea8c4a7e98195e38285ecb6a2e5beaaeb9aafeb92a0e5a18beb9485ed8bb6e5ab84
UHC 癲쒕8倚싮┨節뉖쇀孼꾨ħ違ゅ춢循뚯뒠塋딅틶嫄 1110111110100110100111001110101110100011101110001110101111101111100110101110100110100110101110011110111110111101100001111110101110011001101101001110010111101101100001001110101110101001101001001110101011011110101010101110010110101101100000111110001011100000100011001110110010001010100111001110011110101011100010101110101110111010100111011110101010110001 efa69ceba3b8ebef9ae9a6b9efbd87eb99b4e5ed84eba9a4eadeaae5ad83e2e08cec8a9ce7ab8aebba9deab1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)