To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 畑??乳??淫?????悠?Ⅹ源?????B 10010100101010000011111100111111100100111111101100111111001111111000100011111010001111110011111100111111001111110011111110010111010010010011111110000111010111011000110010111001001111110011111100111111001111110011111101000010 94a83f3f93fb3f3f88fa3f3f3f3f3f97493f875d8cb93f3f3f3f3f42
EUC-JP 畑??乳??淫??孼??悠??源??孼??B 11001000101010100011111100111111110001101111110100111111001111111011000011111100001111110011111110001111101110101100001100111111001111111100110110101010001111110011111110111000101110110011111100111111100011111011101011000011001111110011111101000010 c8aa3f3fc6fd3f3fb0fc3f3f8fbac33f3fcdaa3f3fb8bb3f3f8fbac33f3f42
UTF-8 畑밴퉭乳득룚淫볛뀮孼꾨챷悠븝Ⅹ源낅꺏孼뽰걢B 11100111100101011001000111101011101100001011010011101101100010011010110111100100101110011011001111101011100100111001110111101011101000111001101011100110101101111010101111101011101100111001101111101011100000001010111011100101101011011011110011101010101111101010100011101100101100011011011111100110100000101010000011101011101110001001110111100010100001011010100111100110101110101001000011101011100000101000010111101010101110101000111111100101101011011011110011101011101111011011000011101010101100011010001001000010 e79591ebb0b4ed89ade4b9b3eb939deba39ae6b7abebb39beb80aee5adbceabea8ecb1b7e682a0ebb89de285a9e6ba90eb8285eaba8fe5adbcebbdb0eab1a242
UHC 畑밴퉭乳득룚淫볛뀮孼꾨챷悠븝Ⅹ源낅꺏孼뽰걢B 11101111101001011011100111101010101110011000010111101010111000011011010111100110100011111001011011101011111000101001001111100010100001011010010011100101111011011000010011101011101010101000010011101010111011011011101011101111101001011011100111101010101110011000010111101011100000111011010111100101111011011001011011101100100000011000101101000010 efa5b9eab985eae1b5e68f96ebe293e285a4e5ed84ebaa84eaedbaefa5b9eab985eb83b5e5ed96ec818b42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)