To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???鵝??爺や?歟③?????εИ鵝??^ 001111110011111100111111111010100100000000111111001111111001011011101010100000101110001000111111100111110110001010000111010000100011111100111111001111110011111100111111100000111100001110000100010010011110101001000000001111110011111101011110 3f3f3fea403f3f96ea82e23f9f6287423f3f3f3f3f83c38449ea403f3f5e
EUC-JP ???鵝??爺や?歟?????墉εИ鵝??^ 00111111001111110011111111110011101000010011111100111111110011001110110010100100111001000011111111011101110000110011111100111111001111110011111100111111100011111011100010111111101001101100010110100111101010101111001110100001001111110011111101011110 3f3f3ff3a13f3fcceca4e43fddc33f3f3f3f3f8fb8bfa6c5a7aaf3a13f3f5e
UTF-8 燎쒕젛鵝롩벤爺や퐥歟③씄溜쀤퐥墉εИ鵝롦궒^ 1110111110100111100000001110110010010010100101011110110010100000100110111110100110110101100111011110101110100001101010011110101110110010101001001110011110001000101110101110001110000010100001001110110110010000101001011110011010101101100111111110001010010001101000101110110010010100100001001110111110100111100010111110110010000000101001001110110110010000101001011110010110100010100010011100111010110101110100001001100011101001101101011001110111101011101000011010011011101010101101101001001001011110 efa780ec9295eca09be9b59deba1a9ebb2a4e788bae38284ed90a5e6ad9fe291a2ec9484efa78bec80a4ed90a5e5a289ceb5d098e9b59deba1a6eab6925e
UHC 燎쒕젛鵝롩벤爺や퐥歟③씄溜쀤퐥墉εИ鵝롦궒^ 11101000111110111001110011101011101000001001011111100100101111011000111011101001101110101010010111100101101011001010101011100100101111011000111011100110101000101010100011101001100111011001110011101010111111101001011111100100101111011000111011101001101110101010010111100101101011001010101011100100101111011000111011100110100000101010011101011110 e8fb9ceba097e4bd8ee9baa5e5acaae4bd8ee6a2a8e99d9ceafe97e4bd8ee9baa5e5acaae4bd8ee682a75e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)