To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 畑??乳??淫??押る?萸??蹂??悠? 10010100101010000011111100111111100100111111101100111111001111111000100011111010001111110011111110001001100111111000001011101001001111111110010011001110001111110011111111100110111110000011111100111111100101110100100100111111 94a83f3f93fb3f3f88fa3f3f899f82e93fe4ce3f3fe6f83f3f97493f
EUC-JP 畑??乳??淫??押る?萸??蹂??悠? 11001000101010100011111100111111110001101111110100111111001111111011000011111100001111110011111110110010101000011010010011101011001111111110100011010000001111110011111111101100111110100011111100111111110011011010101000111111 c8aa3f3fc6fd3f3fb0fc3f3fb2a1a4eb3fe8d03f3fecfa3f3fcdaa3f
UTF-8 畑밴퉭乳득룚淫뉗꽑押る굟萸썽뇹蹂껊씮悠튎 111001111001010110010001111010111011000010110100111011011000100110101101111001001011100110110011111010111001001110011101111010111010001110011010111001101011011110101011111010111000100110010111111010101011110110010001111001101000101010111100111000111000001010001011111010101011010110011111111010001001000010111000111011001000110110111101111010111000011110111001111010001011100110000010111010101011101110001010111011001001010010101110111001101000001010100000111011011000101010001110 e79591ebb0b4ed89ade4b9b3eb939deba39ae6b7abeb8997eabd91e68abce3828beab59fe890b8ec8dbdeb87b9e8b982eabb8aec94aee682a0ed8a8e
UHC 畑밴퉭乳득룚淫뉗꽑押る굟萸썽뇹蹂껊씮悠튎 11101111101001011011100111101010101110011000010111101010111000011011010111100110100011111001011011101011111000101000011111101100100001001010000011100100111000111010101011101011100000101000011111101011101011011011110111101001101101001010011011101011101100111000001111101011100111011011111111101010111011011011101001000010 efa5b9eab985eae1b5e68f96ebe287ec84a0e4e3aaeb8287ebadbde9b4a6ebb383eb9dbfeaedba42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)