To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???\??????????\??????? 00111111001111110011111101011100001111110011111100111111001111110011111100111111001111110011111100111111001111110101110000111111001111110011111100111111001111110011111100111111 3f3f3f5c3f3f3f3f3f3f3f3f3f3f5c3f3f3f3f3f3f3f
SJIS-WIN テδィ\テつ古つ湘つ洲テδィ\テつ催つ湘つ住 110000111000001111000010101010000101110011000011100000101100001010001100110000111000001011000010100011111100001110000010110000101000111101000110110000111000001111000010101010000101110011000011100000101100001010001101110000111000001011000010100011111100001110000010110000101000111101011010 c383c2a85cc382c28cc382c28fc382c28f46c383c2a85cc382c28dc382c28fc382c28f5a
EUC-JP テδィ\テつ古つ湘つ洲テδィ\テつ催つ湘つ住 100011101100001110100110110001001000111010101000010111001000111011000011101001001100010010111000110001011010010011000100101111101100010110100100110001001011110110100111100011101100001110100110110001001000111010101000010111001000111011000011101001001100010010111010110001011010010011000100101111101100010110100100110001001011110110111011 8ec3a6c48ea85c8ec3a4c4b8c5a4c4bec5a4c4bda78ec3a6c48ea85c8ec3a4c4bac5a4c4bec5a4c4bdbb
UTF-8 テδィ\テつ古つ湘つ洲テδィ\テつ催つ湘つ住 111011111011111010000011110011101011010011101111101111011010100001011100111011111011111010000011111000111000000110100100111001011000111110100100111000111000000110100100111001101011100110011000111000111000000110100100111001101011010010110010111011111011111010000011110011101011010011101111101111011010100001011100111011111011111010000011111000111000000110100100111001011000001010101100111000111000000110100100111001101011100110011000111000111000000110100100111001001011110110001111 efbe83ceb4efbda85cefbe83e381a4e58fa4e381a4e6b998e381a4e6b4b2efbe83ceb4efbda85cefbe83e381a4e582ace381a4e6b998e381a4e4bd8f
UHC ?δ?\?つ古つ湘つ洲?δ?\?つ催つ湘つ住 001111111010010111100100001111110101110000111111101010101100010011001101101011111010101011000100110111111100111110101010110001001111000110111101001111111010010111100100001111110101110000111111101010101100010011110101110010101010101011000100110111111100111110101010110001001111000110101100 3fa5e43f5c3faac4cdafaac4dfcfaac4f1bd3fa5e43f5c3faac4f5caaac4dfcfaac4f1ac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)