To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蘖??悠?ぜ乙l?汚??壹??飮??孃 10011111010100000011111100111111100101110100100100111111100000101011101010001001101100111000001010001100001111111000100110011000001111110011111110011010111000110011111100111111100111110101101000111111001111111001101101101111 9f503f3f97493f82ba89b3828c3f89983f3f9ae33f3f9f5a3f3f9b6f
EUC-JP 蘖??悠?ぜ乙l?汚??壹??飮??孃 11011101101100010011111100111111110011011010101000111111101001001011110010110010101101011010001111101100001111111011000111111000001111110011111111010100111001010011111100111111110111011011101100111111001111111101010111010000 ddb13f3fcdaa3fa4bcb2b5a3ec3fb1f83f3fd4e53f3fddbb3f3fd5d0
UTF-8 蘖뽰궠悠껇ぜ乙l뒙汚삳쪋壹썲넼飮뉖쎗孃 111010001001100010010110111010111011110110110000111010101011011010100000111001101000001010100000111010101011101110000111111000111000000110011100111001001011100110011001111011111011110110001100111010111001001010011001111001101011000110011010111011001000001010110011111011001010101010001011111001011010001110111001111011001000110110110010111010111000010010111100111010011010001110101110111010111000100110010110111011001000111010010111111001011010110110000011 e89896ebbdb0eab6a0e682a0eabb87e3819ce4b999efbd8ceb9299e6b19aec82b3ecaa8be5a3b9ec8db2eb84bce9a3aeeb8996ec8e97e5ad83
UHC 蘖뽰궠悠껇ぜ乙l뒙汚삳쪋壹썲넼飮뉖쎗孃 1110010111101110100101101110110010000010101100111110101011101101100000111110100010101010101111001110101111100000101000111110110010001010100101101110011111111101101110111110101110100101100001011110110011101100101111011110010110000110101101101110101111100110100001111110101110011011101111101110010110111110 e5ee96ec82b3eaed83e8aabcebe0a3ec8a96e7fdbbeba585ececbde586b6ebe687eb9bbee5be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)