To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 弔?垣兢??縡?怨峰?弔?垣兢??縡?怨峰?B 1001001010100010001111111000101001011111100110010101110100111111001111111110001101110001001111111000100110000101100101011111010000111111100100101010001000111111100010100101111110011001010111010011111100111111111000110111000100111111100010011000010110010101111101000011111101000010 92a23f8a5f995d3f3fe3713f898595f43f92a23f8a5f995d3f3fe3713f898595f43f42
EUC-JP 弔?垣兢??縡?怨峰?弔?垣兢??縡?怨峰?B 1100010010100100001111111011001111000000110100011011111000111111001111111110010111010010001111111011000111100101110010101111011000111111110001001010010000111111101100111100000011010001101111100011111100111111111001011101001000111111101100011110010111001010111101100011111101000010 c4a43fb3c0d1be3f3fe5d23fb1e5caf63fc4a43fb3c0d1be3f3fe5d23fb1e5caf63f42
UTF-8 弔렟垣兢렚렖縡렕怨峰썬弔렟垣兢렚렖縡렕怨峰썬B 11100101101111001001010011101011101000001001111111100101100111101010001111100101100001011010001011101011101000001001101011101011101000001001011011100111101110001010000111101011101000001001010111100110100000001010100011100101101100111011000011101100100011011010110011100101101111001001010011101011101000001001111111100101100111101010001111100101100001011010001011101011101000001001101011101011101000001001011011100111101110001010000111101011101000001001010111100110100000001010100011100101101100111011000011101100100011011010110001000010 e5bc94eba09fe59ea3e585a2eba09aeba096e7b8a1eba095e680a8e5b3b0ec8dace5bc94eba09fe59ea3e585a2eba09aeba096e7b8a1eba095e680a8e5b3b0ec8dac42
UHC 弔렟垣兢렚렖縡렕怨峰썬弔렟垣兢렚렖縡렕怨峰썬B 111100001100000010001110101100001110101010101111110100001110011110001110101011011000111010101011111011101010110110001110101010101110101010110011110111001110100010111101111000111111000011000000100011101011000011101010101011111101000011100111100011101010110110001110101010111110111010101101100011101010101011101010101100111101110011101000101111011110001101000010 f0c08eb0eaafd0e78ead8eabeead8eaaeab3dce8bde3f0c08eb0eaafd0e78ead8eabeead8eaaeab3dce8bde342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)