To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 題?翁??企慮爾?題?翁??企慮爾?^ 1001000111101000001111111000100110100101001111110011111110001010111010011001011110110110100011101010001000111111100100011110100000111111100010011010010100111111001111111000101011101001100101111011011010001110101000100011111101011110 91e83f89a53f3f8ae997b68ea23f91e83f89a53f3f8ae997b68ea23f5e
EUC-JP 題?翁??企慮爾?題?翁??企慮爾?^ 1100001011101010001111111011001010100111001111110011111110110100111010111100111010111000101111001010010000111111110000101110101000111111101100101010011100111111001111111011010011101011110011101011100010111100101001000011111101011110 c2ea3fb2a73f3fb4ebceb8bca43fc2ea3fb2a73f3fb4ebceb8bca43f5e
UTF-8 題렲翁댓렱企慮爾왼題렲翁댓렱企慮爾외^ 11101001101000011000110011101011101000001011001011100111101111111000000111101011100011001001001111101011101000001011000111100100101111001000000111100110100001011010111011100111100010001011111011101100100110011011110011101001101000011000110011101011101000001011001011100111101111111000000111101011100011001001001111101011101000001011000111100100101111001000000111100110100001011010111011100111100010001011111011101100100110011011100001011110 e9a18ceba0b2e7bf81eb8c93eba0b1e4bc81e685aee788beec99bce9a18ceba0b2e7bf81eb8c93eba0b1e4bc81e685aee788beec99b85e
UHC 題렲翁댓렱企慮爾왼題렲翁댓렱企慮爾외^ 11110000101110011000111010111111111010001011101010110100111100011000111010111110110100001110101011010101111001111110110010110011101111111101111011110000101110011000111010111111111010001011101010110100111100011000111010111110110100001110101011010101111001111110110010110011101111111101110001011110 f0b98ebfe8bab4f18ebed0ead5e7ecb3bfdef0b98ebfe8bab4f18ebed0ead5e7ecb3bfdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)