To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???壯??????????壯???????B 00111111001111110011111110011010111000010011111100111111001111110011111100111111001111110011111100111111001111110011111110011010111000010011111100111111001111110011111100111111001111110011111101000010 3f3f3f9ae13f3f3f3f3f3f3f3f3f3f9ae13f3f3f3f3f3f3f42
EUC-JP ???壯??????????壯???????B 00111111001111110011111111010100111000110011111100111111001111110011111100111111001111110011111100111111001111110011111111010100111000110011111100111111001111110011111100111111001111110011111101000010 3f3f3fd4e33f3f3f3f3f3f3f3f3f3fd4e33f3f3f3f3f3f3f42
UTF-8 了몌푽壯낉슘廉뉛푵連푲了몌푽壯낉슘廉뉛푵連푲B 11101111101001101011101011101011101010101000110011101101100100011011110111100101101000111010111111101011100000101000100111101100100010101001100011101111101001101010001011101011100010011001101111101101100100011011010111101111101001101001101011101101100100011011001011101111101001101011101011101011101010101000110011101101100100011011110111100101101000111010111111101011100000101000100111101100100010101001100011101111101001101010001011101011100010011001101111101101100100011011010111101111101001101001101011101101100100011011001001000010 efa6baebaa8ced91bde5a3afeb8289ec8a98efa6a2eb899bed91b5efa69aed91b2efa6baebaa8ced91bde5a3afeb8289ec8a98efa6a2eb899bed91b5efa69aed91b242
UHC 了몌푽壯낉슘廉뉛푵連푲了몌푽壯낉슘廉뉛푵連푲B 111010001110011110111000111011111011111010001000111011011110000010000101111011111011110110110111111001101111010110000111111011111011111010000011111001101110011010111110011110101110100011100111101110001110111110111110100010001110110111100000100001011110111110111101101101111110011011110101100001111110111110111110100000111110011011100110101111100111101001000010 e8e7b8efbe88ede085efbdb7e6f587efbe83e6e6be7ae8e7b8efbe88ede085efbdb7e6f587efbe83e6e6be7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)