To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 塋ゅ???ぐ娃??塋ゅ???ぐ娃??塋ゅ? 10011010110010001000001011100011001111110011111100111111100000101010111010001000101000010011111100111111100110101100100010000010111000110011111100111111001111111000001010101110100010001010000100111111001111111001101011001000100000101110001100111111 9ac882e33f3f3f82ae88a13f3f9ac882e33f3f3f82ae88a13f3f9ac882e33f
EUC-JP 塋ゅ???ぐ娃??塋ゅ?孼?ぐ娃??塋ゅ? 110101001100101010100100111001010011111100111111001111111010010010110000101100001010001100111111001111111101010011001010101001001110010100111111100011111011101011000011001111111010010010110000101100001010001100111111001111111101010011001010101001001110010100111111 d4caa4e53f3f3fa4b0b0a33f3fd4caa4e53f8fbac33fa4b0b0a33f3fd4caa4e53f
UTF-8 塋ゅ춼掠욆ぐ娃쒎춼塋ゅ뜵孼껇ぐ娃쒏릍塋ゅ춼 111001011010000110001011111000111000001010000101111011001011011010111100111011111010010110110101111011001001101010000110111000111000000110010000111001011010100010000011111011001001001010001110111011001011011010111100111001011010000110001011111000111000001010000101111010111001110010110101111001011010110110111100111010101011101110000111111000111000000110010000111001011010100010000011111011001001001010001111111010111010011010001101111001011010000110001011111000111000001010000101111011001011011010111100 e5a18be38285ecb6bcefa5b5ec9a86e38190e5a883ec928eecb6bce5a18be38285eb9cb5e5adbceabb87e38190e5a883ec928feba68de5a18be38285ecb6bc
UHC 塋ゅ춼掠욆ぐ娃쒎춼塋ゅ뜵孼껇ぐ娃쒏릍塋ゅ춼 111001111010101110101010111001011010110110011000111001011011000110011110111010001010101010110000111010001101111110011100111001011010110110011000111001111010101110101010111001011000110110110011111001011110110110000011111010001010101010110000111010001101111110011100111001101011100010101100111001111010101110101010111001011010110110011000 e7abaae5ad98e5b19ee8aab0e8df9ce5ad98e7abaae58db3e5ed83e8aab0e8df9ce6b8ace7abaae5ad98

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)