To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ???霓??幽??[???霓??幽??[^ 00111111001111110011111111101000101111010011111100111111100101110100100000111111001111110101101100111111001111110011111111101000101111010011111100111111100101110100100000111111001111110101101101011110 3f3f3fe8bd3f3f97483f3f5b3f3f3fe8bd3f3f97483f3f5b5e
EUC-JP ???霓??幽??[???霓??幽??[^ 00111111001111110011111111110000101111110011111100111111110011011010100100111111001111110101101100111111001111110011111111110000101111110011111100111111110011011010100100111111001111110101101101011110 3f3f3ff0bf3f3fcda93f3f5b3f3f3ff0bf3f3fcda93f3f5b5e
UTF-8 殮쏅뜐霓양썪幽볢돻[殮쏅뜐霓양썪幽볢돻[^ 111011111010011010100101111011001000111110000101111010111001110010010000111010011001110010010011111011001001011010010001111011001000110110101010111001011011100110111101111010111011001110100010111010111000111110111011010110111110111110100110101001011110110010001111100001011110101110011100100100001110100110011100100100111110110010010110100100011110110010001101101010101110010110111001101111011110101110110011101000101110101110001111101110110101101101011110 efa6a5ec8f85eb9c90e99c93ec9691ec8daae5b9bdebb3a2eb8fbb5befa6a5ec8f85eb9c90e99c93ec9691ec8daae5b9bdebb3a2eb8fbb5b5e
UHC 殮쏅뜐霓양썪幽볢돻[殮쏅뜐霓양썪幽볢돻[^ 111001101111100110011011111010111000110110010011111001111110011110111110111001111001101110011011111010101110101110010011111010001000100110111110010110111110011011111001100110111110101110001101100100111110011111100111101111101110011110011011100110111110101011101011100100111110100010001001101111100101101101011110 e6f99beb8d93e7e7bee79b9beaeb93e889be5be6f99beb8d93e7e7bee79b9beaeb93e889be5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)