To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 悟??裕??循??[悟??裕??循??[^ 100011001110010100111111001111111001011101010100001111110011111110001111011110100011111100111111010110111000110011100101001111110011111110010111010101000011111100111111100011110111101000111111001111110101101101011110 8ce53f3f97543f3f8f7a3f3f5b8ce53f3f97543f3f8f7a3f3f5b5e
EUC-JP 悟??裕??循??[悟??裕??循??[^ 101110001110011100111111001111111100110110110101001111110011111110111101110110110011111100111111010110111011100011100111001111110011111111001101101101010011111100111111101111011101101100111111001111110101101101011110 b8e73f3fcdb53f3fbddb3f3f5bb8e73f3fcdb53f3fbddb3f3f5b5e
UTF-8 悟듽굠裕낁삃循덈츉[悟듽굠裕낁삃循덈츉[^ 111001101000001010011111111010111001001110111101111010101011010110100000111010001010001110010101111010111000001010000001111011001000001010000011111001011011111010101010111010111000110110001000111011001011100010001001010110111110011010000010100111111110101110010011101111011110101010110101101000001110100010100011100101011110101110000010100000011110110010000010100000111110010110111110101010101110101110001101100010001110110010111000100010010101101101011110 e6829feb93bdeab5a0e8a395eb8281ec8283e5beaaeb8d88ecb8895be6829feb93bdeab5a0e8a395eb8281ec8283e5beaaeb8d88ecb8895b5e
UHC 悟듽굠裕낁삃循덈츉[悟듽굠裕낁삃循덈츉[^ 111001111111011010001010111000111000001010001000111010111010111010000101111010001001100010001010111000101110000010001000111010111010111010000101010110111110011111110110100010101110001110000010100010001110101110101110100001011110100010011000100010101110001011100000100010001110101110101110100001010101101101011110 e7f68ae38288ebae85e8988ae2e088ebae855be7f68ae38288ebae85e8988ae2e088ebae855b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)