To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????D^R?????D^o?????D^]^ 00111111001111110011111100111111001111110100010001011110010100100011111100111111001111110011111100111111010001000101111001101111001111110011111100111111001111110011111101000100010111100101110101011110 3f3f3f3f3f445e523f3f3f3f3f445e6f3f3f3f3f3f445e5d5e
SJIS-WIN 﨑乞ぐ妺ィD^R﨑乞ぐ妺ィD^o﨑乞ぐ妺ィD^]^ 11111010101100011000110011101110100000101010111011111010101001011010100001000100010111100101001011111010101100011000110011101110100000101010111011111010101001011010100001000100010111100110111111111010101100011000110011101110100000101010111011111010101001011010100001000100010111100101110101011110 fab18cee82aefaa5a8445e52fab18cee82aefaa5a8445e6ffab18cee82aefaa5a8445e5d5e
EUC-JP ?乞ぐ妺ィD^R?乞ぐ妺ィD^o?乞ぐ妺ィD^]^ 00111111101110001111000010100100101100001000111110111001101101111000111010101000010001000101111001010010001111111011100011110000101001001011000010001111101110011011011110001110101010000100010001011110011011110011111110111000111100001010010010110000100011111011100110110111100011101010100001000100010111100101110101011110 3fb8f0a4b08fb9b78ea8445e523fb8f0a4b08fb9b78ea8445e6f3fb8f0a4b08fb9b78ea8445e5d5e
UTF-8 﨑乞ぐ妺ィD^R﨑乞ぐ妺ィD^o﨑乞ぐ妺ィD^]^ 11101111101010001001000111100100101110011001111011100011100000011001000011100101101001101011101011101111101111011010100001000100010111100101001011101111101010001001000111100100101110011001111011100011100000011001000011100101101001101011101011101111101111011010100001000100010111100110111111101111101010001001000111100100101110011001111011100011100000011001000011100101101001101011101011101111101111011010100001000100010111100101110101011110 efa891e4b99ee38190e5a6baefbda8445e52efa891e4b99ee38190e5a6baefbda8445e6fefa891e4b99ee38190e5a6baefbda8445e5d5e
UHC ?乞ぐ??D^R?乞ぐ??D^o?乞ぐ??D^]^ 00111111110010111111011110101010101100000011111100111111010001000101111001010010001111111100101111110111101010101011000000111111001111110100010001011110011011110011111111001011111101111010101010110000001111110011111101000100010111100101110101011110 3fcbf7aab03f3f445e523fcbf7aab03f3f445e6f3fcbf7aab03f3f445e5d5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)