To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 撓ζ?貢汚??撓?????節??要?? 1001110110011010100000111100010000111111100011010111011010001001100110000011111100111111100111011001101000111111001111110011111100111111001111111001000011011111001111110011111110010111011101100011111100111111 9d9a83c43f8d7689983f3f9d9a3f3f3f3f3f90df3f3f97763f3f
EUC-JP 撓ζ?貢汚??撓?????節??要?? 1101100111111010101001101100011000111111101110011101011110110001111110000011111100111111110110011111101000111111001111110011111100111111001111111100000011100001001111110011111111001101110101110011111100111111 d9faa6c63fb9d7b1f83f3fd9fa3f3f3f3f3fc0e13f3fcdd73f3f
UTF-8 撓ζ틮貢汚뗰쉈撓뚳쉥遼㏛벤節곤쉽要쏉숴 1110011010010010100100111100111010110110111011011000101110101110111010001011001010100010111001101011000110011010111010111001011110110000111011001000100110001000111001101001001010010011111010111001101010110011111011001000100110100101111011111010011110000011111000111000111110011011111010111011001010100100111001111010111110000000111010101011001110100100111011001000100110111101111010001010011010000001111011001000111110001001111011001000100010110100 e69293ceb6ed8baee8b2a2e6b19aeb97b0ec8988e69293eb9ab3ec89a5efa783e38f9bebb2a4e7af80eab3a4ec89bde8a681ec8f89ec88b4
UHC 撓ζ틮貢汚뗰쉈撓뚳쉥遼㏛벤節곤쉽要쏉숴 1110100011110101101001011110011010111010100110001100110111111000111001111111110110001011111011111011110110100101111010001111010110001100111011111011110110101011111010011010110010100111111001001011101010100101111011111011110110110000111011111011110110110001111010011010100110011011111011111011110110100100 e8f5a5e6ba98cdf8e7fd8befbda5e8f58cefbdabe9aca7e4baa5efbdb0efbdb1e9a99befbda4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)