To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒻れ?油??循?????????ヨ??ル? 111001001110100010000010111010100011111110010110111110110011111100111111100011110111101000111111001111110011111100111111001111110011111100111111001111110011111110000011100010000011111100111111100000111000101100111111 e4e882ea3f96fb3f3f8f7a3f3f3f3f3f3f3f3f3f83883f3f838b3f
EUC-JP 蒻れ?油??循?????洹??縕ヨ??ル? 11101000111010101010010011101100001111111100110011111101001111110011111110111101110110110011111100111111001111110011111100111111100011111100011110111010001111110011111110001111110101001100001010100101111010000011111100111111101001011110101100111111 e8eaa4ec3fccfd3f3fbddb3f3f3f3f3f8fc7ba3f3f8fd4c2a5e83f3fa5eb3f
UTF-8 蒻れ슜油꾣쨫循녿짎捻믡겈洹앹뿉縕ヨ녂戮ル릅 111010001001001010111011111000111000001010001100111011001000101010011100111001101011001010111001111010101011111010100011111011001010100010101011111001011011111010101010111010111000010110111111111011001010011110001110111011111010011010100100111010111010111110100001111010101011001010001000111001101011010010111001111011001001010110111001111010111011111110001001111001111011100010010101111000111000001110101000111010111000010110000010111011111010011110010010111000111000001110101011111010111010011010000101 e892bbe3828cec8a9ce6b2b9eabea3eca8abe5beaaeb85bfeca78eefa6a4ebafa1eab288e6b4b9ec95b9ebbf89e7b895e383a8eb8582efa792e383abeba685
UHC 蒻れ슜油꾣쨫循녿짎捻믡겈洹앹뿉縕ヨ녂戮ル릅 111001011011011010101010111011001001101010101001111010101111101010000100111001101010010010000101111000101110000010000110111010111010001110011010111001101111011110010010111000111000000110100101111010101011011110011101111011001001011110010000111010001011001010101011111010001000011010111010111010111011110110101011111010111011100010101000 e5b6aaec9aa9eafa84e6a485e2e086eba39ae6f792e381a5eab79dec9790e8b2abe886baebbdabebb8a8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)