To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???G?????????G??????B 001111110011111100111111010001110011111100111111001111110011111100111111001111110011111100111111001111110100011100111111001111110011111100111111001111110011111101000010 3f3f3f473f3f3f3f3f3f3f3f3f473f3f3f3f3f3f42
SJIS-WIN ??縟G??邵??堯??縟G??邵??堯B 001111110011111111100011011101000100011100111111001111111110011110111000001111110011111111101010100111110011111100111111111000110111010001000111001111110011111111100111101110000011111100111111111010101001111101000010 3f3fe374473f3fe7b83f3fea9f3f3fe374473f3fe7b83f3fea9f42
EUC-JP ??縟G??邵??堯??縟G??邵??堯B 001111110011111111100101110101010100011100111111001111111110111010111010001111110011111111110100101000010011111100111111111001011101010101000111001111110011111111101110101110100011111100111111111101001010000101000010 3f3fe5d5473f3feeba3f3ff4a13f3fe5d5473f3feeba3f3ff4a142
UTF-8 쐛숰縟G쐛숰邵쐛了堯쐛숰縟G쐛숰邵쐛了堯B 111011001001000010011011111011001000100010110000111001111011100010011111010001111110110010010000100110111110110010001000101100001110100110000010101101011110110010010000100110111110111110100110101110101110010110100000101011111110110010010000100110111110110010001000101100001110011110111000100111110100011111101100100100001001101111101100100010001011000011101001100000101011010111101100100100001001101111101111101001101011101011100101101000001010111101000010 ec909bec88b0e7b89f47ec909bec88b0e982b5ec909befa6bae5a0afec909bec88b0e7b89f47ec909bec88b0e982b5ec909befa6bae5a0af42
UHC 쐛숰縟G쐛숰邵쐛了堯쐛숰縟G쐛숰邵쐛了堯B 100111001000000110011010010010001110100110110010010001111001110010000001100110100100100011100001110100001001110010000001111010001110011111101000111010111001110010000001100110100100100011101001101100100100011110011100100000011001101001001000111000011101000010011100100000011110100011100111111010001110101101000010 9c819a48e9b2479c819a48e1d09c81e8e7e8eb9c819a48e9b2479c819a48e1d09c81e8e7e8eb42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)