To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 畑??乳??淫??瑤??悠?Ⅹ源???k?B 100101001010100000111111001111111001001111111011001111110011111110001000111110100011111100111111111010101010001000111111001111111001011101001001001111111000011101011101100011001011100100111111001111110011111110000010100010110011111101000010 94a83f3f93fb3f3f88fa3f3feaa23f3f97493f875d8cb93f3f3f828b3f42
EUC-JP 畑??乳??淫??瑤??悠??源???k?B 1100100010101010001111110011111111000110111111010011111100111111101100001111110000111111001111111111010010100100001111110011111111001101101010100011111100111111101110001011101100111111001111110011111110100011111010110011111101000010 c8aa3f3fc6fd3f3fb0fc3f3ff4a43f3fcdaa3f3fb8bb3f3f3fa3eb3f42
UTF-8 畑밴퉭乳득룚淫뉗쯾瑤녹럩悠뱄Ⅹ源낅꺏力k돽B 11100111100101011001000111101011101100001011010011101101100010011010110111100100101110011011001111101011100100111001110111101011101000111001101011100110101101111010101111101011100010011001011111101100101011111011111011100111100100011010010011101011100001011011100111101011100111111010100111100110100000101010000011101011101100011000010011100010100001011010100111100110101110101001000011101011100000101000010111101010101110101000111111101111101001101000101011101111101111011000101111101011100011111011110101000010 e79591ebb0b4ed89ade4b9b3eb939deba39ae6b7abeb8997ecafbee791a4eb85b9eb9fa9e682a0ebb184e285a9e6ba90eb8285eaba8fefa68aefbd8beb8fbd42
UHC 畑밴퉭乳득룚淫뉗쯾瑤녹럩悠뱄Ⅹ源낅꺏力k돽B 11101111101001011011100111101010101110011000010111101010111000011011010111100110100011111001011011101011111000101000011111101100101010011000001011101000111111011011001111101100100011101000110011101010111011011011100111101111101001011011100111101010101110011000010111101011100000111011010111100110101100111010001111101011100010011011111101000010 efa5b9eab985eae1b5e68f96ebe287eca982e8fdb3ec8e8ceaedb9efa5b9eab985eb83b5e6b3a3eb89bf42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)