To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 若??肄?????[若??肄?????[^ 10001110111000010011111100111111111000111110010100111111001111110011111100111111001111110101101110001110111000010011111100111111111000111110010100111111001111110011111100111111001111110101101101011110 8ee13f3fe3e53f3f3f3f3f5b8ee13f3fe3e53f3f3f3f3f5b5e
EUC-JP 若??肄?????[若??肄?????[^ 10111100111000110011111100111111111001101110011100111111001111110011111100111111001111110101101110111100111000110011111100111111111001101110011100111111001111110011111100111111001111110101101101011110 bce33f3fe6e73f3f3f3f3f5bbce33f3fe6e73f3f3f3f3f5b5e
UTF-8 若뽧끃肄듿킊栒뱀퐡[若뽧끃肄듿킊栒뱀퐡[^ 111010001000101110100101111010111011110110100111111010111000000110000011111010001000001010000100111010111001001110111111111011011000001010001010111001101010000010010010111010111011000110000000111011011001000010100001010110111110100010001011101001011110101110111101101001111110101110000001100000111110100010000010100001001110101110010011101111111110110110000010100010101110011010100000100100101110101110110001100000001110110110010000101000010101101101011110 e88ba5ebbda7eb8183e88284eb93bfed828ae6a092ebb180ed90a15be88ba5ebbda7eb8183e88284eb93bfed828ae6a092ebb180ed90a15b5e
UHC 若뽧끃肄듿킊栒뱀퐡[若뽧끃肄듿킊栒뱀퐡[^ 111001011011010010010110111000111000010110111001111011001011110110001010111001011011010010010110111000101110001110111001111011001011110110001010010110111110010110110100100101101110001110000101101110011110110010111101100010101110010110110100100101101110001011100011101110011110110010111101100010100101101101011110 e5b496e385b9ecbd8ae5b496e2e3b9ecbd8a5be5b496e385b9ecbd8ae5b496e2e3b9ecbd8a5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)