To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 乳?d乳??哀?■旬?乳?d乳??哀?■旬?^ 1001001111111011001111111000001010000100100100111111101100111111001111111000100010100011001111111000000110100001100011110111101100111111100100111111101100111111100000101000010010010011111110110011111100111111100010001010001100111111100000011010000110001111011110110011111101011110 93fb3f828493fb3f3f88a33f81a18f7b3f93fb3f828493fb3f3f88a33f81a18f7b3f5e
EUC-JP 乳?d乳??哀?■旬?乳?d乳??哀?■旬?^ 1100011011111101001111111010001111100100110001101111110100111111001111111011000010100101001111111010001010100011101111011101110000111111110001101111110100111111101000111110010011000110111111010011111100111111101100001010010100111111101000101010001110111101110111000011111101011110 c6fd3fa3e4c6fd3f3fb0a53fa2a3bddc3fc6fd3fa3e4c6fd3f3fb0a53fa2a3bddc3f5e
UTF-8 乳㏘d乳㏘쨩哀얕■旬㏏乳㏘d乳㏘쨩哀얕■旬㏏^ 11100100101110011011001111100011100011111001100011101111101111011000010011100100101110011011001111100011100011111001100011101100101010001010100111100101100100111000000011101100100101101001010111100010100101101010000011100110100101111010110011100011100011111000111111100100101110011011001111100011100011111001100011101111101111011000010011100100101110011011001111100011100011111001100011101100101010001010100111100101100100111000000011101100100101101001010111100010100101101010000011100110100101111010110011100011100011111000111101011110 e4b9b3e38f98efbd84e4b9b3e38f98eca8a9e59380ec9695e296a0e697ace38f8fe4b9b3e38f98efbd84e4b9b3e38f98eca8a9e59380ec9695e296a0e697ace38f8f5e
UHC 乳㏘d乳㏘쨩哀얕■旬㏏乳㏘d乳㏘쨩哀얕■旬㏏^ 111010101110000110100010111001001010001111100100111010101110000110100010111001001100001010111011111001001110111010111110111010001010000111100001111000101110001010100111101110011110101011100001101000101110010010100011111001001110101011100001101000101110010011000010101110111110010011101110101111101110100010100001111000011110001011100010101001111011100101011110 eae1a2e4a3e4eae1a2e4c2bbe4eebee8a1e1e2e2a7b9eae1a2e4a3e4eae1a2e4c2bbe4eebee8a1e1e2e2a7b95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)