To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????W}?????????W{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101011101111101001111110011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 該?????杖??W}該?????杖??W{^ 100010100101100100111111001111110011111100111111001111111000111111110001001111110011111101010111011111011000101001011001001111110011111100111111001111110011111110001111111100010011111100111111010101110111101101011110 8a593f3f3f3f3f8ff13f3f577d8a593f3f3f3f3f8ff13f3f577b5e
EUC-JP 該??琰??杖??W}該??琰??杖??W{^ 10110011101110100011111100111111100011111100110010110100001111110011111110111110111100110011111100111111010101110111110110110011101110100011111100111111100011111100110010110100001111110011111110111110111100110011111100111111010101110111101101011110 b3ba3f3f8fccb43f3fbef33f3f577db3ba3f3f8fccb43f3fbef33f3f577b5e
UTF-8 該뚲궗琰랑쾹杖⅛옫W}該뚲궗琰랑쾹杖⅛옫W{^ 1110100010101001101100101110101110011010101100101110101010110110100101111110011110010000101100001110101110011110100100011110110010111110101110011110011010011101100101101110001010000101100110111110110010011000101010110101011101111101111010001010100110110010111010111001101010110010111010101011011010010111111001111001000010110000111010111001111010010001111011001011111010111001111001101001110110010110111000101000010110011011111011001001100010101011010101110111101101011110 e8a9b2eb9ab2eab697e790b0eb9e91ecbeb9e69d96e2859bec98ab577de8a9b2eb9ab2eab697e790b0eb9e91ecbeb9e69d96e2859bec98ab577b5e
UHC 該뚲궗琰랑쾹杖⅛옫W}該뚲궗琰랑쾹杖⅛옫W{^ 1111101010110001100011001110111010000010101011001110011011111100101101101111101110110010100011111110110111101000101010001111101110011110101010100101011101111101111110101011000110001100111011101000001010101100111001101111110010110110111110111011001010001111111011011110100010101000111110111001111010101010010101110111101101011110 fab18cee82ace6fcb6fbb28fede8a8fb9eaa577dfab18cee82ace6fcb6fbb28fede8a8fb9eaa577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)