To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 撓???藥??? 10011101100110100011111100111111001111111110010101011010001111110011111100111111 9d9a3f3f3fe55a3f3f3f
EUC-JP 撓???藥??彛 110110011111101000111111001111110011111111101001101110110011111100111111100011111011110011111010 d9fa3f3f3fe9bb3f3f8fbcfa
UTF-8 撓눸노걦藥뀁눦彛 111001101001001010010011111010111000100010111000111010111000010110111000111010101011000110100110111010001001011110100101111010111000000010000001111010111000100010100110111001011011110110011011 e69293eb88b8eb85b8eab1a6e897a5eb8081eb88a6e5bd9b
UHC 撓눸노걦藥뀁눦彛 11101000111101011000011111001110101100111110101110000001100011111110010110110111101100101110110010000111101111011110110010101101 e8f587ceb3eb818fe5b7b2ec87bdecad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)