To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???議??邑??筌??誼??袁⑦????^ 00111111001111110011111110001011011000110011111100111111100101110101011100111111001111111110001010100011001111110011111110001011011000100011111100111111111001011100110110000111010001100011111100111111001111110011111101011110 3f3f3f8b633f3f97573f3fe2a33f3f8b623f3fe5cd87463f3f3f3f5e
EUC-JP ???議??邑??筌??誼??袁?????^ 001111110011111100111111101101011100010000111111001111111100110110111000001111110011111111100100101001010011111100111111101101011100001100111111001111111110101011001111001111110011111100111111001111110011111101011110 3f3f3fb5c43f3fcdb83f3fe4a53f3fb5c33f3feacf3f3f3f3f3f5e
UTF-8 囹덈슢議곩▶邑꿔걠筌뗪퀡誼드짃袁⑦닑嶺띿램^ 11101111101001101010100111101011100011011000100011101100100010101010001011101000101011011011000011101010101100111010100111100010100101101011011011101001100000101001000111101010101111111001010011101010101100011010000011100111101011011000110011101011100101111010101011101101100000001010000111101000101010101011110011101011100100111001110011101100101001111000001111101000101000101000000111100010100100011010011011101011100010111001000111101111101001101010101111101011100111011011111111101011100111101010100001011110 efa6a9eb8d88ec8aa2e8adb0eab3a9e296b6e98291eabf94eab1a0e7ad8ceb97aaed80a1e8aabceb939ceca783e8a281e291a6eb8b91efa6abeb9dbfeb9ea85e
UHC 囹덈슢議곩▶邑꿔걠筌뗪퀡誼드짃袁⑦닑嶺띿램^ 11100111101010101000100011101011100110101010111011101100101000011000000111100101101000101011101011101011111010011011001011100011100000011000100111101111101001111000101111101010101100111001010111101011111111101011010111100101101000111001001111101010101111101010100011101101100010001001011011100111101011011000110111101100101101111010010101011110 e7aa88eb9aaeeca181e5a2baebe9b2e38189efa78beab395ebfeb5e5a393eabea8ed8896e7ad8decb7a55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)