To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN テつ鱈テつ炭テつ巽テつ端テつ淡テつ妥つ「B 11000011100000101100001010010010010011001100001110000010110000101001001001011001110000111000001011000010100100100100011011000011100000101100001010010010010110111100001110000010110000101001001001010111110000111000001011000010100100011100001110000010110000101010001001000010 c382c2924cc382c29259c382c29246c382c2925bc382c29257c382c291c382c2a242
EUC-JP テつ鱈テつ炭テつ巽テつ端テつ淡テつ妥つ「B 1000111011000011101001001100010011000011101011011000111011000011101001001100010011000011101110101000111011000011101001001100010011000011101001111000111011000011101001001100010011000011101111001000111011000011101001001100010011000011101110001000111011000011101001001100010011000010110001011010010011000100100011101010001001000010 8ec3a4c4c3ad8ec3a4c4c3ba8ec3a4c4c3a78ec3a4c4c3bc8ec3a4c4c3b88ec3a4c4c2c5a4c48ea242
UTF-8 テつ鱈テつ炭テつ巽テつ端テつ淡テつ妥つ「B 11101111101111101000001111100011100000011010010011101001101100011000100011101111101111101000001111100011100000011010010011100111100000101010110111101111101111101000001111100011100000011010010011100101101101111011110111101111101111101000001111100011100000011010010011100111101010111010111111101111101111101000001111100011100000011010010011100110101101111010000111101111101111101000001111100011100000011010010011100101101001101010010111100011100000011010010011101111101111011010001001000010 efbe83e381a4e9b188efbe83e381a4e782adefbe83e381a4e5b7bdefbe83e381a4e7abafefbe83e381a4e6b7a1efbe83e381a4e5a6a5e381a4efbda242
UHC ?つ??つ炭?つ巽?つ端?つ淡?つ妥つ?B 001111111010101011000100001111110011111110101010110001001111011110101001001111111010101011000100111000011101111000111111101010101100010011010011101011100011111110101010110001001101001110111111001111111010101011000100111101101110011010101010110001000011111101000010 3faac43f3faac4f7a93faac4e1de3faac4d3ae3faac4d3bf3faac4f6e6aac43f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)