To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 繹??墻??歪??渦??埇???ラ?絶?? 11100011100010000011111100111111100110101101010000111111001111111001100001100011001111110011111110001001010100010011111100111111111110101001101000111111001111110011111110000011100010010011111110010000111000100011111100111111 e3883f3f9ad43f3f98633f3f89513f3ffa9a3f3f3f83893f90e23f3f
EUC-JP 繹??墻??歪??渦??埇???ラ?絶?? 1110010111101000001111110011111111010100110101100011111100111111110011111100010000111111001111111011000110110010001111110011111110001111101101111110011100111111001111110011111110100101111010010011111111000000111001000011111100111111 e5e83f3fd4d63f3fcfc43f3fb1b23f3f8fb7e73f3f3fa5e93fc0e43f3f
UTF-8 繹욕ㅁ墻듸쉼歪뉛쉭渦욇눑埇욥콊樂ラ땿絶볩슬 111001111011100110111001111011001001101010010101111000111000010110000001111001011010001010111011111010111001001110111000111011001000100110111100111001101010110110101010111010111000100110011011111011001000100110101101111001101011100010100110111011001001101010000111111010111000100010010001111001011001111110000111111011001001101010100101111011001011110110001010111011111010011010111111111000111000001110101001111010111001010110111111111001111011010110110110111010111011001110101001111011001000101010101100 e7b9b9ec9a95e38581e5a2bbeb93b8ec89bce6adaaeb899bec89ade6b8a6ec9a87eb8891e59f87ec9aa5ecbd8aefa6bfe383a9eb95bfe7b5b6ebb3a9ec8aac
UHC 繹욕ㅁ墻듸쉼歪뉛쉭渦욇눑埇욥콊樂ラ땿絶볩슬 111001101011101010111111111001011010010010110001111011011101111110110101111011111011110110110000111010001110000010000111111011111011110110101101111010001011111010011110111010011000011110101101111010011011100110111111111010011011000110000110111010001111100110101011111010011000101110010101111011111011111010010011111011111011110110111101 e6babfe5a4b1eddfb5efbdb0e8e087efbdade8be9ee987ade9b9bfe9b186e8f9abe98b95efbe93efbdbd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)