To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 譽??央?ぐ娃??D譽??央?ぐ娃??D^ 1110011010100011001111110011111110001001100110110011111110000010101011101000100010100001001111110011111101000100111001101010001100111111001111111000100110011011001111111000001010101110100010001010000100111111001111110100010001011110 e6a33f3f899b3f82ae88a13f3f44e6a33f3f899b3f82ae88a13f3f445e
EUC-JP 譽??央?ぐ娃??D譽??央?ぐ娃??D^ 1110110010100101001111110011111110110001111110110011111110100100101100001011000010100011001111110011111101000100111011001010010100111111001111111011000111111011001111111010010010110000101100001010001100111111001111110100010001011110 eca53f3fb1fb3fa4b0b0a33f3f44eca53f3fb1fb3fa4b0b0a33f3f445e
UTF-8 譽긷춼央뉓ぐ娃쒑뇮D譽긷춼央뉓ぐ娃쒑뇮D^ 111010001010110110111101111010101011100010110111111011001011011010111100111001011010010010101110111010111000100110010011111000111000000110010000111001011010100010000011111011001001001010010001111010111000011110101110010001001110100010101101101111011110101010111000101101111110110010110110101111001110010110100100101011101110101110001001100100111110001110000001100100001110010110101000100000111110110010010010100100011110101110000111101011100100010001011110 e8adbdeab8b7ecb6bce5a4aeeb8993e38190e5a883ec9291eb87ae44e8adbdeab8b7ecb6bce5a4aeeb8993e38190e5a883ec9291eb87ae445e
UHC 譽긷춼央뉓ぐ娃쒑뇮D譽긷춼央뉓ぐ娃쒑뇮D^ 111001111110001010110001111001011010110110011000111001001110011110000111111010001010101010110000111010001101111110011100111010001000011110010011010001001110011111100010101100011110010110101101100110001110010011100111100001111110100010101010101100001110100011011111100111001110100010000111100100110100010001011110 e7e2b1e5ad98e4e787e8aab0e8df9ce8879344e7e2b1e5ad98e4e787e8aab0e8df9ce88793445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)