To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???嚥ラ?晤??訝 0011111100111111001111111001101010001011100000111000100100111111100111011110101100111111001111111110011001100010 3f3f3f9a8b83893f9deb3f3fe662
EUC-JP ???嚥ラ?晤??訝 0011111100111111001111111101001111101011101001011110100100111111110110101110110100111111001111111110101111000011 3f3f3fd3eba5e93fdaed3f3febc3
UTF-8 曆쇗쮵嚥ラ웺晤됭쾮訝 111011111010011010001011111011001000011110010111111011001010111010110101111001011001101010100101111000111000001110101001111011001001101110111010111001101001100110100100111010111001000010101101111011001011111010101110111010001010100010011101 efa68bec8797ecaeb5e59aa5e383a9ec9bbae699a4eb90adecbeaee8a89d
UHC 曆쇗쮵嚥ラ웺晤됭쾮訝 1110011010110111101111001110011010101000100100101110011010111111101010111110100110011111100001101110011111111011100010011110100010110010100001011110010010111000 e6b7bce6a892e6bfabe99f86e7fb89e8b285e4b8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)