To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 堤?町?醫狡??町?醫?堤?町?醫狡??町?醫 1001001011100111001111111001001010101100001111111110011111001110111000001100001000111111001111111001001010101100001111111110011111001110001111111001001011100111001111111001001010101100001111111110011111001110111000001100001000111111001111111001001010101100001111111110011111001110 92e73f92ac3fe7cee0c23f3f92ac3fe7ce3f92e73f92ac3fe7cee0c23f3f92ac3fe7ce
EUC-JP 堤?町?醫狡??町?醫苽堤?町?醫狡??町?醫 11000100111010010011111111000100101011100011111111101110110100001110000011000100001111110011111111000100101011100011111111101110110100001000111111010111110110111100010011101001001111111100010010101110001111111110111011010000111000001100010000111111001111111100010010101110001111111110111011010000 c4e93fc4ae3feed0e0c43f3fc4ae3feed08fd7dbc4e93fc4ae3feed0e0c43f3fc4ae3feed0
UTF-8 堤렚町렑醫狡렏렚町렑醫苽堤렚町렑醫狡렏렚町렑醫 111001011010000010100100111010111010000010011010111001111001010010111010111010111010000010010001111010011000011010101011111001111000101110100001111010111010000010001111111010111010000010011010111001111001010010111010111010111010000010010001111010011000011010101011111010001000101110111101111001011010000010100100111010111010000010011010111001111001010010111010111010111010000010010001111010011000011010101011111001111000101110100001111010111010000010001111111010111010000010011010111001111001010010111010111010111010000010010001111010011000011010101011 e5a0a4eba09ae794baeba091e986abe78ba1eba08feba09ae794baeba091e986abe88bbde5a0a4eba09ae794baeba091e986abe78ba1eba08feba09ae794baeba091e986ab
UHC 堤렚町렑醫狡렏렚町렑醫苽堤렚町렑醫狡렏렚町렑醫 11110000101001111000111010101101111011111110101110001110101001101110110010100010110011101110101010001110101001011000111010101101111011111110101110001110101001101110110010100010110011011100100111110000101001111000111010101101111011111110101110001110101001101110110010100010110011101110101010001110101001011000111010101101111011111110101110001110101001101110110010100010 f0a78eadefeb8ea6eca2ceea8ea58eadefeb8ea6eca2cdc9f0a78eadefeb8ea6eca2ceea8ea58eadefeb8ea6eca2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)