To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??泣?㎝楡??語⑤?誼??怨??額??純 1110000110011111001111110011111110001011100000110011111110000111011100001001111010111110001111110011111110001100111010101000011101000100001111111000101101100010001111110011111110001001100001010011111100111111100010100111101000111111001111111000111110000011 e19f3f3f8b833f87709ebe3f3f8cea87443f8b623f3f89853f3f8a7a3f3f8f83
EUC-JP 癲??泣??楡??語??誼??怨??額??純 111000101010000100111111001111111011010111100011001111110011111111011100110000000011111100111111101110001110110000111111001111111011010111000011001111110011111110110001111001010011111100111111101100111101101100111111001111111011110111100011 e2a13f3fb5e33f3fdcc03f3fb8ec3f3fb5c33f3fb1e53f3fb3db3f3fbde3
UTF-8 癲⑸뜄泣쒙㎝楡녹춷語⑤벡誼룟슫怨몃듌額됰돍純 111001111001100110110010111000101001000110111000111010111001110010000100111001101011001110100011111011001001001010011001111000111000111010011101111001101010010110100001111010111000010110111001111011001011011010110111111010001010101010011110111000101001000110100100111010111011001010100001111010001010101010111100111010111010001110011111111011001000101010101011111001101000000010101000111010111010101010000011111010111001001110001100111010011010000110001101111010111001000010110000111010111000111110001101111001111011010010010100 e799b2e291b8eb9c84e6b3a3ec9299e38e9de6a5a1eb85b9ecb6b7e8aa9ee291a4ebb2a1e8aabceba39fec8aabe680a8ebaa83eb938ce9a18deb90b0eb8f8de7b494
UHC 癲⑸뜄泣쒙㎝楡녹춷語⑤벡誼룟슫怨몃듌額됰돍純 1110111110100110101010011110101110001101100010001110101111101000100111001110111110100111101011111110101011111000101100111110110010101101100100111110010111011110101010001110101110111010101001001110101111111110101101111110010110011010101101001110101010110011101110001110101110001010101111111110010011111110100010011110101110001001100110111110001011101101 efa6a9eb8d88ebe89cefa7afeaf8b3ecad93e5dea8ebbaa4ebfeb7e59ab4eab3b8eb8abfe4fe89eb899be2ed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)