To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????­???????????????^ 00111111001111110011111100111111001111111010110100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3fad3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鶯??節??鸚??疫?????形?????^ 111010011111001000111111001111111001000011011111001111110011111111101010010111110011111100111111100010010111010100111111001111110011111100111111001111111000110001100000001111110011111100111111001111110011111101011110 e9f23f3f90df3f3fea5f3f3f89753f3f3f3f3f8c603f3f3f3f3f5e
EUC-JP 鶯??節??鸚??疫??渶??形?????^ 1111001011110100001111110011111111000000111000010011111100111111111100111100000000111111001111111011000111010110001111110011111110001111110001111110110100111111001111111011011111000001001111110011111100111111001111110011111101011110 f2f43f3fc0e13f3ff3c03f3fb1d63f3f8fc7ed3f3fb7c13f3f3f3f3f5e
UTF-8 鶯뚨퇅節뫈­鸚긺댌疫욕즺渶뽳슛形⒴ㅁ連득뮈^ 111010011011011010101111111010111001101010101000111011011000011110000101111001111010111110000000111010111010101110001000110000101010110111101001101110001001101011101010101110001011101011101011100011001000110011100111100101101010101111101100100110101001010111101100101001101011101011100110101110001011011011101011101111011011001111101100100010101001101111100101101111011010001011100010100100101011010011100011100001011000000111101111101001101001101011101011100100111001110111101011101011101000100001011110 e9b6afeb9aa8ed8785e7af80ebab88c2ade9b89aeab8baeb8c8ce796abec9a95eca6bae6b8b6ebbdb3ec8a9be5bda2e292b4e38581efa69aeb939debae885e
UHC 鶯뚨퇅節뫈­鸚긺댌疫욕즺渶뽳슛形⒴ㅁ連득뮈^ 11100101101000111000110011100111101101111001011011101111101111011011100011111011101000011010100111100101101001001011000111100111100010001011010111100110101110011011111111100101101000111000110011100111101101111001011011101111101111011011100011111011101000011010100111100101101001001011000111100110111001101011010111100110101110011011111101011110 e5a38ce7b796efbdb8fba1a9e5a4b1e788b5e6b9bfe5a38ce7b796efbdb8fba1a9e5a4b1e6e6b5e6b9bf5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)