To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???爾e?儀??玉??揖ε?遺??畏 001111110011111100111111100011101010001010000010100001010011111110001011010101100011111100111111100010111100101000111111001111111001011101001011100000111100001100111111100010001110001000111111001111111000100011011000 3f3f3f8ea282853f8b563f3f8bca3f3f974b83c33f88e23f3f88d8
EUC-JP 艅??爾e?儀??玉??揖ε?遺??畏 1000111111010110111111010011111100111111101111001010010010100011111001010011111110110101101101110011111100111111101101101100110000111111001111111100110110101100101001101100010100111111101100001110010000111111001111111011000011011010 8fd6fd3f3fbca4a3e53fb5b73f3fb6cc3f3fcdaca6c53fb0e43f3fb0da
UTF-8 艅덇쐼爾e립儀숉뫝玉좎눦揖ε뿗遺쇰렱畏 1110100010001001100001011110101110001101100001111110110010010000101111001110011110001000101111101110111110111101100001011110101110100110101111011110010110000100100000001110110010001000100010011110101110101011100111011110011110001110100010011110110010100010100011101110101110001000101001101110011010001111100101101100111010110101111010111011111110010111111010011000000110111010111011001000011110110000111010111010000010110001111001111001010110001111 e88985eb8d87ec90bce788beefbd85eba6bde58480ec8889ebab9de78e89eca28eeb88a6e68f96ceb5ebbf97e981baec87b0eba0b1e7958f
UHC 艅덇쐼爾e립儀숉뫝玉좎눦揖ε뿗遺쇰렱畏 1110011010101001100010001110101010111110101000101110110010110011101000111110010110111000101100111110101111110000100110011110110110010001101111011110100010101100101000001110110010000111101111011110101111100111101001011110010110010111100110101110101110110110101111001110101110001110101111101110100011100110 e6a988eabea2ecb3a3e5b8b3ebf099ed91bde8aca0ec87bdebe7a5e5979aebb6bceb8ebee8e6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)