To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??疑?材稽??疑?材界??疑?旭?? 001111110011111110001011010111100011111110001101110111101000110001101101001111110011111110001011010111100011111110001101110111101000101001000101001111110011111110001011010111100011111110001000101011100011111100111111 3f3f8b5e3f8dde8c6d3f3f8b5e3f8dde8a453f3f8b5e3f88ae3f3f
EUC-JP ??疑?材稽??疑?材界??疑?旭?? 001111110011111110110101101111110011111110111010111000001011011111001110001111110011111110110101101111110011111110111010111000001011001110100110001111110011111110110101101111110011111110110000101100000011111100111111 3f3fb5bf3fbae0b7ce3f3fb5bf3fbae0b3a63f3fb5bf3fb0b03f3f
UTF-8 欌렪疑렑材稽欌렪疑렑材界欌렪疑렑旭흗떵 111001101010110010001100111010111010000010101010111001111001011010010001111010111010000010010001111001101001110110010000111001111010100010111101111001101010110010001100111010111010000010101010111001111001011010010001111010111010000010010001111001101001110110010000111001111001010110001100111001101010110010001100111010111010000010101010111001111001011010010001111010111010000010010001111001101001011110101101111011011001110110010111111010111001011010110101 e6ac8ceba0aae79691eba091e69d90e7a8bde6ac8ceba0aae79691eba091e69d90e7958ce6ac8ceba0aae79691eba091e697aded9d97eb96b5
UHC 欌렪疑렑材稽欌렪疑렑材界欌렪疑렑旭흗떵 1110110111101011100011101011100011101011111101111000111010100110111011101010011111001101101001101110110111101011100011101011100011101011111101111000111010100110111011101010011111001101101000111110110111101011100011101011100011101011111101111000111010100110111010011110111111001000111010011011011010111010 edeb8eb8ebf78ea6eea7cda6edeb8eb8ebf78ea6eea7cda3edeb8eb8ebf78ea6e9efc8e9b6ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)