To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 薰「薰「薰サ迢「 111110111001111010100010111110111001111010100010111110111001111010111011111001111000101110100010 fb9ea2fb9ea2fb9ebbe78ba2
EUC-JP ?「?「?サ迢「 00111111100011101010001000111111100011101010001000111111100011101011101111101101111010111000111010100010 3f8ea23f8ea23f8ebbedeb8ea2
UTF-8 薰「薰「薰サ迢「 111010001001011010110000111011111011110110100010111010001001011010110000111011111011110110100010111010001001011010110000111011111011110110111011111010001011111110100010111011111011110110100010 e896b0efbda2e896b0efbda2e896b0efbdbbe8bfa2efbda2
UHC 薰?薰?薰??? 1111110110111001001111111111110110111001001111111111110110111001001111110011111100111111 fdb93ffdb93ffdb93f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)