To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????i?????????iB 001111110011111100111111001111110011111100111111001111110011111100111111011010010011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f6942
SJIS-WIN テつ竪テつ辰テつ形iテつ竪テつ辰テつ形iB 110000111000001011000010100100100100011111000011100000101100001010010010010000111100001110000010110000101000110001100000011010011100001110000010110000101001001001000111110000111000001011000010100100100100001111000011100000101100001010001100011000000110100101000010 c382c29247c382c29243c382c28c6069c382c29247c382c29243c382c28c606942
EUC-JP テつ竪テつ辰テつ形iテつ竪テつ辰テつ形iB 100011101100001110100100110001001100001110101000100011101100001110100100110001001100001110100100100011101100001110100100110001001011011111000001011010011000111011000011101001001100010011000011101010001000111011000011101001001100010011000011101001001000111011000011101001001100010010110111110000010110100101000010 8ec3a4c4c3a88ec3a4c4c3a48ec3a4c4b7c1698ec3a4c4c3a88ec3a4c4c3a48ec3a4c4b7c16942
UTF-8 テつ竪テつ辰テつ形iテつ竪テつ辰テつ形iB 111011111011111010000011111000111000000110100100111001111010101110101010111011111011111010000011111000111000000110100100111010001011111010110000111011111011111010000011111000111000000110100100111001011011110110100010011010011110111110111110100000111110001110000001101001001110011110101011101010101110111110111110100000111110001110000001101001001110100010111110101100001110111110111110100000111110001110000001101001001110010110111101101000100110100101000010 efbe83e381a4e7abaaefbe83e381a4e8beb0efbe83e381a4e5bda269efbe83e381a4e7abaaefbe83e381a4e8beb0efbe83e381a4e5bda26942
UHC ?つ竪?つ辰?つ形i?つ竪?つ辰?つ形iB 001111111010101011000100111000101011010100111111101010101100010011110010111000110011111110101010110001001111101110100001011010010011111110101010110001001110001010110101001111111010101011000100111100101110001100111111101010101100010011111011101000010110100101000010 3faac4e2b53faac4f2e33faac4fba1693faac4e2b53faac4f2e33faac4fba16942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)