To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 竪捉村谷続則谷損村竪捉村谷続則谷損孫^ 10010010010001111001000110101000100100011011101010010010010010101001000110110001100100011010010110010010010010101001000110111001100100011011101010010010010001111001000110101000100100011011101010010010010010101001000110110001100100011010010110010010010010101001000110111001100100011011011101011110 924791a891ba924a91b191a5924a91b991ba924791a891ba924a91b191a5924a91b991b75e
EUC-JP 竪捉村谷続則谷損村竪捉村谷続則谷損孫^ 11000011101010001100001010101010110000101011110011000011101010111100001010110011110000101010011111000011101010111100001010111011110000101011110011000011101010001100001010101010110000101011110011000011101010111100001010110011110000101010011111000011101010111100001010111011110000101011100101011110 c3a8c2aac2bcc3abc2b3c2a7c3abc2bbc2bcc3a8c2aac2bcc3abc2b3c2a7c3abc2bbc2b95e
UTF-8 竪捉村谷続則谷損村竪捉村谷続則谷損孫^ 11100111101010111010101011100110100011011000100111100110100111011001000111101000101100001011011111100111101101101001101011100101100010011000011111101000101100001011011111100110100100001000110111100110100111011001000111100111101010111010101011100110100011011000100111100110100111011001000111101000101100001011011111100111101101101001101011100101100010011000011111101000101100001011011111100110100100001000110111100101101011011010101101011110 e7abaae68d89e69d91e8b0b7e7b69ae58987e8b0b7e6908de69d91e7abaae68d89e69d91e8b0b7e7b69ae58987e8b0b7e6908de5adab5e
UHC 竪捉村谷?則谷損村竪捉村谷?則谷損孫^ 1110001010110101111100111011010111110101101111011100110111011011001111111111011011001110110011011101101111100001110111111111010110111101111000101011010111110011101101011111010110111101110011011101101100111111111101101100111011001101110110111110000111011111111000011101110101011110 e2b5f3b5f5bdcddb3ff6cecddbe1dff5bde2b5f3b5f5bdcddb3ff6cecddbe1dfe1dd5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)