To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 烏??擁??????≪???????癰??? 1000100101000111001111110011111110010111011010010011111100111111001111110011111100111111001111111000000111100001001111110011111100111111001111110011111100111111001111111110000110011110001111110011111100111111 89473f3f97693f3f3f3f3f3f81e13f3f3f3f3f3f3fe19e3f3f3f
EUC-JP 烏??擁??????≪???????癰??? 1011000110101000001111110011111111001101110010100011111100111111001111110011111100111111001111111010001011100011001111110011111100111111001111110011111100111111001111111110000111111110001111110011111100111111 b1a83f3fcdca3f3f3f3f3f3fa2e33f3f3f3f3f3f3fe1fe3f3f3f
UTF-8 烏녿젻擁얠눊溜⑸쓷溜≪늻溜긴펿溜뺣젧癰꾨퉬溜 111001111000001110001111111010111000010110111111111011001010000010111011111001101001001110000001111011001001011010100000111010111000100010001010111011111010011110001011111000101001000110111000111011001001001110110111111011111010011110001011111000101000100110101010111010111000101010111011111011111010011110001011111010101011100010110100111011011000111010111111111011111010011110001011111010111011101010100011111011001010000010100111111001111001100110110000111010101011111010101000111011011000100110101100111011111010011110001011 e7838feb85bfeca0bbe69381ec96a0eb888aefa78be291b8ec93b7efa78be289aaeb8abbefa78beab8b4ed8ebfefa78bebbaa3eca0a7e799b0eabea8ed89acefa78b
UHC 烏녿젻擁얠눊溜⑸쓷溜≪늻溜긴펿溜뺣젧癰꾨퉬溜 1110100010100001100001101110101110100000101011101110100010110110101111101110110010000111101010001110101011111110101010011110101110011101100101001110101011111110101000011110110010001000100001001110101011111110101100011110010010111100100011101110101011111110100101011110101110100000100111111110100010111001100001001110101110111001100001001110101011111110 e8a186eba0aee8b6beec87a8eafea9eb9d94eafea1ec8884eafeb1e4bc8eeafe95eba09fe8b984ebb984eafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)