To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????ф?幽??域??幼????????? 0011111100111111001111110011111110000100100001100011111110010111010010000011111100111111100010001110011000111111001111111001011101100011001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f84863f97483f3f88e63f3f97633f3f3f3f3f3f3f3f3f
EUC-JP ????ф?幽??域??幼??洧??孼??? 001111110011111100111111001111111010011111100110001111111100110110101001001111110011111110110000111010000011111100111111110011011100010000111111001111111000111111000111101101000011111100111111100011111011101011000011001111110011111100111111 3f3f3f3fa7e63fcda93f3fb0e83f3fcdc43f3f8fc7b43f3f8fbac33f3f3f
UTF-8 麗몃쓷流ф끽幽꾨룆域밟뫁幼뗦갭洧뷀뜙孼뽏룸룆 1110111110100110100010001110101110101010100000111110110010010011101101111110111110100111100010101101000110000100111010111000000110111101111001011011100110111101111010101011111010101000111010111010001110000110111001011001111110011111111010111011000010011111111010111010101110000001111001011011100110111100111010111001011110100110111010101011000010101101111001101011010010100111111010111011011110000000111010111001110010011001111001011010110110111100111010111011110110001111111010111010001110111000111010111010001110000110 efa688ebaa83ec93b7efa78ad184eb81bde5b9bdeabea8eba386e59f9febb09febab81e5b9bceb97a6eab0ade6b4a7ebb780eb9c99e5adbcebbd8feba3b8eba386
UHC 麗몃쓷流ф끽幽꾨룆域밟뫁幼뗦갭洧뷀뜙孼뽏룸룆 1110011010110000101110001110101110011101100101001110101011111100101011001110011010110011101000111110101011101011100001001110101110001111100001011110011010110100101110011110001010010001101001011110101011101010100010111110011010110000101110001110101011111011100101001110110110001101100111001110010111101101100101101100111010110111111010111000111110000101 e6b0b8eb9d94eafcace6b3a3eaeb84eb8f85e6b4b9e291a5eaea8be6b0b8eafb94ed8d9ce5ed96ceb7eb8f85

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)