To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ????ы?姨?????????ы?姨?????^ 0011111100111111001111110011111110000100100011010011111110011011010010000011111100111111001111110011111100111111001111110011111100111111001111111000010010001101001111111001101101001000001111110011111100111111001111110011111101011110 3f3f3f3f848d3f9b483f3f3f3f3f3f3f3f3f848d3f9b483f3f3f3f3f5e
EUC-JP ????ы?姨?????????ы?姨?????^ 0011111100111111001111110011111110100111111011010011111111010101101010010011111100111111001111110011111100111111001111110011111100111111001111111010011111101101001111111101010110101001001111110011111100111111001111110011111101011110 3f3f3f3fa7ed3fd5a93f3f3f3f3f3f3f3f3fa7ed3fd5a93f3f3f3f3f5e
UTF-8 淋륂쉲泥ы쉹姨랁슗淋믪찈淋륂쉲泥ы쉹姨랁슗淋믪찈^ 1110111110100111101101011110101110100101100000101110110010001001101100101110111110100111101000111101000110001011111011001000100110111001111001011010011110101000111010111001111010000001111011001000101010010111111011111010011110110101111010111010111110101010111011001011000010001000111011111010011110110101111010111010010110000010111011001000100110110010111011111010011110100011110100011000101111101100100010011011100111100101101001111010100011101011100111101000000111101100100010101001011111101111101001111011010111101011101011111010101011101100101100001000100001011110 efa7b5eba582ec89b2efa7a3d18bec89b9e5a7a8eb9e81ec8a97efa7b5ebafaaecb088efa7b5eba582ec89b2efa7a3d18bec89b9e5a7a8eb9e81ec8a97efa7b5ebafaaecb0885e
UHC 淋륂쉲泥ы쉹姨랁슗淋믪찈淋륂쉲泥ы쉹姨랁슗淋믪찈^ 11101100111110001000111111101101100110101000100111101100101100101010110011101101100110101000111111101100101010011000110111101101100110101010011011101100111110001001001011101100101010011000110011101100111110001000111111101101100110101000100111101100101100101010110011101101100110101000111111101100101010011000110111101101100110101010011011101100111110001001001011101100101010011000110001011110 ecf88fed9a89ecb2aced9a8feca98ded9aa6ecf892eca98cecf88fed9a89ecb2aced9a8feca98ded9aa6ecf892eca98c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)