To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ????????Щ湲?????碍?┰臾??^ 001111110011111100111111001111110011111100111111001111110011111110000100010110101001111111010001001111110011111100111111001111110011111110001010010101100011111110000100101110111110010001101011001111110011111101011110 3f3f3f3f3f3f3f3f845a9fd13f3f3f3f3f8a563f84bbe46b3f3f5e
EUC-JP ????????Щ湲?????碍?┰臾??^ 001111110011111100111111001111110011111100111111001111110011111110100111101110111101111011010011001111110011111100111111001111110011111110110011101101110011111110101000101111011110011111001100001111110011111101011110 3f3f3f3f3f3f3f3fa7bbded33f3f3f3f3fb3b73fa8bde7cc3f3f5e
UTF-8 若쒕젦琉뚦뼔利쎈Щ湲됧츦溜욌쨲碍쒖┰臾산툈^ 111011111010010110110100111011001001001010010101111011001010000010100110111011111010011110001100111010111001101010100110111010111011110010010100111011111010011110011101111011001000111010001000110100001010100111100110101110011011001011101011100100001010011111101100101110001010011011101111101001111000101111101100100110101000110011101100101010001011001011100111101000101000110111101100100100101001011011100010100101001011000011101000100001111011111011101100100000101011000011101101100010001000100001011110 efa5b4ec9295eca0a6efa78ceb9aa6ebbc94efa79dec8e88d0a9e6b9b2eb90a7ecb8a6efa78bec9a8ceca8b2e7a28dec9296e294b0e887beec82b0ed88885e
UHC 若쒕젦琉뚦뼔利쎈Щ湲됧츦溜욌쨲碍쒖┰臾산툈^ 11100101101011101001110011101011101000001001111011101011101001001000110011100101100101101001110011101100101001101011110111101011101011001011101111101010101110001000100111100101101011101001110011101010111111101001111011101011101001001000110011100100111101001001110011101100101001101011110111101011101011001011101111101010101110001000000101011110 e5ae9ceba09eeba48ce5969ceca6bdebacbbeab889e5ae9ceafe9eeba48ce4f49ceca6bdebacbbeab8815e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)