To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 罌??爰?????榮?│?〓╋余??? 1110001110100000001111110011111111100000101001110011111100111111001111110011111100111111100111101100010000111111100001001010000000111111100000011010110010000100101101001001011101011101001111110011111100111111 e3a03f3fe0a73f3f3f3f3f9ec43f84a03f81ac84b4975d3f3f3f
EUC-JP 罌??爰?????榮?│?〓╋余??彛 11100110101000100011111100111111111000001010100100111111001111110011111100111111001111111101110011000110001111111010100010100010001111111010001010101110101010001011011011001101101111100011111100111111100011111011110011111010 e6a23f3fe0a93f3f3f3f3fdcc63fa8a23fa2aea8b6cdbe3f3f8fbcfa
UTF-8 罌븐꼨爰덆턁琉븍꽰榮싷│類〓╋余쒕ㅉ彛 111001111011110110001100111010111011100010010000111010101011110010101000111001111000100010110000111010111000110110000110111011011000010010000001111011111010011110001100111010111011100010001101111010101011110110110000111001101010011010101110111011001000101110110111111000101001010010000010111011111010011110010000111000111000000010010011111000101001010110001011111001001011110110011001111011001001001010010101111000111000010110001001111001011011110110011011 e7bd8cebb890eabca8e788b0eb8d86ed8481efa78cebb88deabdb0e6a6aeec8bb7e29482efa790e38093e2958be4bd99ec9295e38589e5bd9b
UHC 罌븐꼨爰덆턁琉븍꽰榮싷│類〓╋余쒕ㅉ彛 1110010110100010101110101110110010000100100001011110101010111010100010001110100110110101100111011110101110100100101110101110101110000100101110111110011110110100100110101110111110100110101000101110101110111010101000011110101110100110101101101110010111111001100111001110101110100100101110011110110010101101 e5a2baec8485eaba88e9b59deba4baeb84bbe7b49aefa6a2ebbaa1eba6b6e5f99ceba4b9ecad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)