To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?у????永??淫?К蒻????????^ 001111111000010010000101001111110011111100111111001111111000100101101001001111110011111110001000111110100011111110000100010010111110010011101000001111110011111100111111001111110011111100111111001111110011111101011110 3f84853f3f3f3f89693f3f88fa3f844be4e83f3f3f3f3f3f3f3f5e
EUC-JP ?у????永??淫?К蒻?????洧??^ 0011111110100111111001010011111100111111001111110011111110110001110010100011111100111111101100001111110000111111101001111010110011101000111010100011111100111111001111110011111100111111100011111100011110110100001111110011111101011110 3fa7e53f3f3f3fb1ca3f3fb0fc3fa7ace8ea3f3f3f3f3f8fc7b43f3f5e
UTF-8 寧у텩溜곕젉永귣쓣淫욅К蒻앸젾溜뗧탞洧뺟봽^ 1110111110100110101010101101000110000011111011011000010110101001111011111010011110001011111010101011001110010101111011001010000010001001111001101011000010111000111010101011011110100011111011001001001110100011111001101011011110101011111011001001101010000101110100001001101011101000100100101011101111101100100101011011100011101100101000001011111011101111101001111000101111101011100101111010011111101101100000111001111011100110101101001010011111101011101110101001111111101011101101001011110101011110 efa6aad183ed85a9efa78beab395eca089e6b0b8eab7a3ec93a3e6b7abec9a85d09ae892bbec95b8eca0beefa78beb97a7ed839ee6b4a7ebba9febb4bd5e
UHC 寧у텩溜곕젉永귣쓣淫욅К蒻앸젾溜뗧탞洧뺟봽^ 11100111101011001010110011100101101101101001110111101010111111101011000011101011101000001000101111100111101101011000001011101011100111011000010011101011111000101001111011100111101011001010110011100101101101101001110111101011101000001011000011101010111111101000101111100111101101011000001011101010111110111001010111100111100101001000010001011110 e7acace5b69deafeb0eba08be7b582eb9d84ebe29ee7acace5b69deba0b0eafe8be7b582eafb95e794845e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)