To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??こ?⇒莎ア?郡??こ?⇒莎ア?裙^ 0011111100111111100000101011000100111111100000011100101111100100101100111000001101000001001111111000110001010011001111110011111110000010101100010011111110000001110010111110010010110011100000110100000100111111111001011110001101011110 3f3f82b13f81cbe4b383413f8c533f3f82b13f81cbe4b383413fe5e35e
EUC-JP ??こ?⇒莎ア?郡??こ?⇒莎ア?裙^ 0011111100111111101001001011001100111111101000101100110111101000101101011010010110100010001111111011011110110100001111110011111110100100101100110011111110100010110011011110100010110101101001011010001000111111111010101110010101011110 3f3fa4b33fa2cde8b5a5a23fb7b43f3fa4b33fa2cde8b5a5a23feae55e
UTF-8 룶쥚こ룶⇒莎ア룵郡룶쥚こ룶⇒莎ア룵裙^ 11101011101000111011011011101100101001011001101011100011100000011001001111101011101000111011011011100010100001111001001011101000100011101000111011100011100000101010001011101011101000111011010111101001100000111010000111101011101000111011011011101100101001011001101011100011100000011001001111101011101000111011011011100010100001111001001011101000100011101000111011100011100000101010001011101011101000111011010111101000101000111001100101011110 eba3b6eca59ae38193eba3b6e28792e88e8ee382a2eba3b5e983a1eba3b6eca59ae38193eba3b6e28792e88e8ee382a2eba3b5e8a3995e
UHC 룶쥚こ룶⇒莎ア룵郡룶쥚こ룶⇒莎ア룵裙^ 10001111101010111010001010001111101010101011001110001111101010111010001010100001110111101110110110101011101000101000111110101010110011111101101110001111101010111010001010001111101010101011001110001111101010111010001010100001110111101110110110101011101000101000111110101010110011111101100101011110 8faba28faab38faba2a1deedaba28faacfdb8faba28faab38faba2a1deedaba28faacfd95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)