To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 慫?億?洙姐頃??烝?壓?┴姐頃??B 1001110011001111001111111000100110101101001111111001111110101010100010001011011110001101101000000011111100111111111000000111111000111111100110101101100000111111100001001010100010001000101101111000110110100000001111110011111101000010 9ccf3f89ad3f9faa88b78da03f3fe07e3f9ad83f84a888b78da03f3f42
EUC-JP 慫?億?洙姐頃??烝?壓?┴姐頃??B 1101100011010001001111111011001010101111001111111101111010101100101100001011100110111010101000100011111100111111110111111101111100111111110101001101101000111111101010001010101010110000101110011011101010100010001111110011111101000010 d8d13fb2af3fdeacb0b9baa23f3fdfdf3fd4da3fa8aab0b9baa23f3f42
UTF-8 慫렯億꿸洙姐頃렰렩烝렯壓뀀┴姐頃렰렩B 11100110100001011010101111101011101000001010111111100101100001001000010011101010101111111011100011100110101101001001100111100101101001111001000011101001101000001000001111101011101000001011000011101011101000001010100111100111100000111001110111101011101000001010111111100101101000111001001111101011100000001000000011100010100101001011010011100101101001111001000011101001101000001000001111101011101000001011000011101011101000001010100101000010 e685abeba0afe58484eabfb8e6b499e5a790e9a083eba0b0eba0a9e7839deba0afe5a393eb8080e294b4e5a790e9a083eba0b0eba0a942
UHC 慫렯億꿸洙姐頃렰렩烝렯壓뀀┴姐頃렰렩B 11110000111101101000111010111100111001011110001010110010111010101110001010101010111011101011101111001100111100011000111010111101100011101011011111110001111101101000111010111100111001001110001010110010111010111010011010101010111011101011101111001100111100011000111010111101100011101011011101000010 f0f68ebce5e2b2eae2aaeebbccf18ebd8eb7f1f68ebce4e2b2eba6aaeebbccf18ebd8eb742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)