To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN ???夷??淹??D???夷??淹??D^ 00111111001111110011111110001000110011100011111100111111100111111011100100111111001111110100010000111111001111110011111110001000110011100011111100111111100111111011100100111111001111110100010001011110 3f3f3f88ce3f3f9fb93f3f443f3f3f88ce3f3f9fb93f3f445e
EUC-JP 沅??夷??淹??D沅??夷??淹??D^ 1000111111000110111010010011111100111111101100001101000000111111001111111101111010111011001111110011111101000100100011111100011011101001001111110011111110110000110100000011111100111111110111101011101100111111001111110100010001011110 8fc6e93f3fb0d03f3fdebb3f3f448fc6e93f3fb0d03f3fdebb3f3f445e
UTF-8 沅룹젿夷섏콛淹쒖쮰D沅룹젿夷섏콛淹쒖쮰D^ 111001101011001010000101111010111010001110111001111011001010000010111111111001011010010010110111111011001000010010001111111011001011110110011011111001101011011110111001111011001001001010010110111011001010111010110000010001001110011010110010100001011110101110100011101110011110110010100000101111111110010110100100101101111110110010000100100011111110110010111101100110111110011010110111101110011110110010010010100101101110110010101110101100000100010001011110 e6b285eba3b9eca0bfe5a4b7ec848fecbd9be6b7b9ec9296ecaeb044e6b285eba3b9eca0bfe5a4b7ec848fecbd9be6b7b9ec9296ecaeb0445e
UHC 沅룹젿夷섏콛淹쒖쮰D沅룹젿夷섏콛淹쒖쮰D^ 111010101011011010110111111011001010000010110001111011001010100010011000111011001011000110010100111001011111010010011100111011001010100010001101010001001110101010110110101101111110110010100000101100011110110010101000100110001110110010110001100101001110010111110100100111001110110010101000100011010100010001011110 eab6b7eca0b1eca898ecb194e5f49ceca88d44eab6b7eca0b1eca898ecb194e5f49ceca88d445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)