To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN セュ爾セュ者セュ爾治識セュ爾セュ紗樵ィ爾治識B 10111110101011011000111010100010101111101010110110001110110100101011111010101101100011101010001010001110101000011000111010101111101111101010110110001110101000101011111010101101100011101101000110001111101111111010100010001110101000101000111010100001100011101010111101000010 bead8ea2bead8ed2bead8ea28ea18eafbead8ea2bead8ed18fbfa88ea28ea18eaf42
EUC-JP セュ爾セュ者セュ爾治識セュ爾セュ紗樵ィ爾治識B 100011101011111010001110101011011011110010100100100011101011111010001110101011011011110011010100100011101011111010001110101011011011110010100100101111001010001110111100101100011000111010111110100011101010110110111100101001001000111010111110100011101010110110111100110100111011111011000001100011101010100010111100101001001011110010100011101111001011000101000010 8ebe8eadbca48ebe8eadbcd48ebe8eadbca4bca3bcb18ebe8eadbca48ebe8eadbcd3bec18ea8bca4bca3bcb142
UTF-8 セュ爾セュ者セュ爾治識セュ爾セュ紗樵ィ爾治識B 11101111101111011011111011101111101111011010110111100111100010001011111011101111101111011011111011101111101111011010110111101000100000001000010111101111101111011011111011101111101111011010110111100111100010001011111011100110101100101011101111101000101011011001100011101111101111011011111011101111101111011010110111100111100010001011111011101111101111011011111011101111101111011010110111100111101101001001011111100110101010001011010111101111101111011010100011100111100010001011111011100110101100101011101111101000101011011001100001000010 efbdbeefbdade788beefbdbeefbdade88085efbdbeefbdade788bee6b2bbe8ad98efbdbeefbdade788beefbdbeefbdade7b497e6a8b5efbda8e788bee6b2bbe8ad9842
UHC ??爾??者??爾治識??爾??紗樵?爾治識B 00111111001111111110110010110011001111110011111111101101101110100011111100111111111011001011001111110110101111011110001111011011001111110011111111101100101100110011111100111111110111101110100111110101101000110011111111101100101100111111011010111101111000111101101101000010 3f3fecb33f3fedba3f3fecb3f6bde3db3f3fecb33f3fdee9f5a33fecb3f6bde3db42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)