To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???壓?????佯??濡??臆?????鶯 001111110011111100111111100110101101100000111111001111110011111100111111001111111001100011010001001111110011111110010100010001110011111100111111100010011011000000111111001111110011111100111111001111111110100111110010 3f3f3f9ad83f3f3f3f3f98d13f3f94473f3f89b03f3f3f3f3fe9f2
EUC-JP ???壓?????佯??濡??臆?????鶯 001111110011111100111111110101001101101000111111001111110011111100111111001111111101000011010011001111110011111111000111101010000011111100111111101100101011001000111111001111110011111100111111001111111111001011110100 3f3f3fd4da3f3f3f3f3fd0d33f3fc7a83f3fb2b23f3f3f3f3ff2f4
UTF-8 凉붾젨壓꾧끝溜깅젪佯꾨젧濡덈젵臆뚮젎凉붾젨鶯 111011111010010110111001111010111011011010111110111011001010000010101000111001011010001110010011111010101011111010100111111010111000000110011101111011111010011110001011111010101011100110000101111011001010000010101010111001001011110110101111111010101011111010101000111011001010000010100111111001101011111110100001111010111000110110001000111011001010000010110101111010001000011110000110111010111001101010101110111011001010000010001110111011111010010110111001111010111011011010111110111011001010000010101000111010011011011010101111 efa5b9ebb6beeca0a8e5a393eabea7eb819defa78beab985eca0aae4bdafeabea8eca0a7e6bfa1eb8d88eca0b5e88786eb9aaeeca08eefa5b9ebb6beeca0a8e9b6af
UHC 凉붾젨壓꾧끝溜깅젪佯꾨젧濡덈젵臆뚮젎凉붾젨鶯 1110010110111100100101001110101110100000101000001110010011100010100001001110101010110011101000011110101011111110101100011110101110100000101000101110010110111010100001001110101110100000100111111110101110100001100010001110101110100000101010011110010111100110100011001110101110100000100011111110010110111100100101001110101110100000101000001110010110100011 e5bc94eba0a0e4e284eab3a1eafeb1eba0a2e5ba84eba09feba188eba0a9e5e68ceba08fe5bc94eba0a0e5a3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)