To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ぱ??癲ル?爰??誘Π??蹂≪?俑??乙 1110000110011111100000101100111100111111001111111110000110011111100000111000101100111111111000001010011100111111001111111001011101010101100000111010111000111111001111111110011011111000100000011110000100111111100110001101101000111111001111111000100110110011 e19f82cf3f3fe19f838b3fe0a73f3f975583ae3f3fe6f881e13f98da3f3f89b3
EUC-JP 癲ぱ??癲ル?爰??誘Π??蹂≪?俑??乙 1110001010100001101001001101000100111111001111111110001010100001101001011110101100111111111000001010100100111111001111111100110110110110101001101011000000111111001111111110110011111010101000101110001100111111110100001101110000111111001111111011001010110101 e2a1a4d13f3fe2a1a5eb3fe0a93f3fcdb6a6b03f3fecfaa2e33fd0dc3f3fb2b5
UTF-8 癲ぱ됯괭癲ル쵈爰뚳쭓誘Π싮윀蹂≪떫俑앸돆乙 1110011110011001101100101110001110000001101100011110101110010000101011111110101010110100101011011110011110011001101100101110001110000011101010111110110010110101100010001110011110001000101100001110101110011010101100111110110010101101100100111110100010101010100110001100111010100000111011001000101110101110111011001001110010000000111010001011100110000010111000101000100110101010111010111001011010101011111001001011111110010001111011001001010110111000111010111000111110000110111001001011100110011001 e799b2e381b1eb90afeab4ade799b2e383abecb588e788b0eb9ab3ecad93e8aa98cea0ec8baeec9c80e8b982e289aaeb96abe4bf91ec95b8eb8f86e4b999
UHC 癲ぱ됯괭癲ル쵈爰뚳쭓誘Π싮윀蹂≪떫俑앸돆乙 111011111010011010101010110100011000100111101010101100011010101011101111101001101010101111101011101011001000101011101010101110101000110011101111101001111000101111101011101011111010010111010000100110101110100110011111100010111110101110110011101000011110110010110110101101011110100110110101100111011110101110001001100101111110101111100000 efa6aad189eab1aaefa6abebac8aeaba8cefa78bebafa5d09ae99f8bebb3a1ecb6b5e9b59deb8997ebe0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)