To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????b[?????????b[^ 0011111100111111001111110011111100111111001111110011111100111111001111110110001001011011001111110011111100111111001111110011111100111111001111110011111100111111011000100101101101011110 3f3f3f3f3f3f3f3f3f625b3f3f3f3f3f3f3f3f3f625b5e
SJIS-WIN 塋ゅ?熬?ぐ???b[塋ゅ?熬?ぐ???b[^ 10011010110010001000001011100011001111111110000010010010001111111000001010101110001111110011111100111111011000100101101110011010110010001000001011100011001111111110000010010010001111111000001010101110001111110011111100111111011000100101101101011110 9ac882e33fe0923f82ae3f3f3f625b9ac882e33fe0923f82ae3f3f3f625b5e
EUC-JP 塋ゅ?熬?ぐ???b[塋ゅ?熬?ぐ???b[^ 11010100110010101010010011100101001111111101111111110010001111111010010010110000001111110011111100111111011000100101101111010100110010101010010011100101001111111101111111110010001111111010010010110000001111110011111100111111011000100101101101011110 d4caa4e53fdff23fa4b03f3f3f625bd4caa4e53fdff23fa4b03f3f3f625b5e
UTF-8 塋ゅ춼熬뽬ぐ筽삣뜏b[塋ゅ춼熬뽬ぐ筽삣뜏b[^ 1110010110100001100010111110001110000010100001011110110010110110101111001110011110000110101011001110101110111101101011001110001110000001100100001110011110101101101111011110110010000010101000111110101110011100100011110110001001011011111001011010000110001011111000111000001010000101111011001011011010111100111001111000011010101100111010111011110110101100111000111000000110010000111001111010110110111101111011001000001010100011111010111001110010001111011000100101101101011110 e5a18be38285ecb6bce786acebbdace38190e7adbdec82a3eb9c8f625be5a18be38285ecb6bce786acebbdace38190e7adbdec82a3eb9c8f625b5e
UHC 塋ゅ춼熬뽬ぐ筽삣뜏b[塋ゅ춼熬뽬ぐ筽삣뜏b[^ 1110011110101011101010101110010110101101100110001110100010100010100101101110100010101010101100001110100010100100101110111110010110001101100100100110001001011011111001111010101110101010111001011010110110011000111010001010001010010110111010001010101010110000111010001010010010111011111001011000110110010010011000100101101101011110 e7abaae5ad98e8a296e8aab0e8a4bbe58d92625be7abaae5ad98e8a296e8aab0e8a4bbe58d92625b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)