To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????L[?????????L[^ 0011111100111111001111110011111100111111001111110011111100111111001111110100110001011011001111110011111100111111001111110011111100111111001111110011111100111111010011000101101101011110 3f3f3f3f3f3f3f3f3f4c5b3f3f3f3f3f3f3f3f3f4c5b5e
SJIS-WIN ?????????L[?????????L[^ 0011111100111111001111110011111100111111001111110011111100111111001111110100110001011011001111110011111100111111001111110011111100111111001111110011111100111111010011000101101101011110 3f3f3f3f3f3f3f3f3f4c5b3f3f3f3f3f3f3f3f3f4c5b5e
EUC-JP ?????????L[?????????L[^ 0011111100111111001111110011111100111111001111110011111100111111001111110100110001011011001111110011111100111111001111110011111100111111001111110011111100111111010011000101101101011110 3f3f3f3f3f3f3f3f3f4c5b3f3f3f3f3f3f3f3f3f4c5b5e
UTF-8 챙짹혙챙짢혟챙짼짯L[챙짹혙챙짢혟챙짼짯L[^ 1110110010110001100110011110110010100111101110011110110110011000100110011110110010110001100110011110110010100111101000101110110110011000100111111110110010110001100110011110110010100111101111001110110010100111101011110100110001011011111011001011000110011001111011001010011110111001111011011001100010011001111011001011000110011001111011001010011110100010111011011001100010011111111011001011000110011001111011001010011110111100111011001010011110101111010011000101101101011110 ecb199eca7b9ed9899ecb199eca7a2ed989fecb199eca7bceca7af4c5becb199eca7b9ed9899ecb199eca7a2ed989fecb199eca7bceca7af4c5b5e
UHC 챙짹혙챙짢혟챙짼짯L[챙짹혙챙짢혟챙짼짯L[^ 1100001110101100110000101011000111000010100001001100001110101100110000101010100011000010100010011100001110101100110000101011001011000010101011010100110001011011110000111010110011000010101100011100001010000100110000111010110011000010101010001100001010001001110000111010110011000010101100101100001010101101010011000101101101011110 c3acc2b1c284c3acc2a8c289c3acc2b2c2ad4c5bc3acc2b1c284c3acc2a8c289c3acc2b2c2ad4c5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)