To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌Q??筌l?泣??擬??筌??泣??肯泣 11100010101000111000001001110000001111110011111111100010101000111000001010001100001111111000101110000011001111110011111110001011010110110011111100111111111000101010001100111111001111111000101110000011001111110011111110001101011011011000101110000011 e2a382703f3fe2a3828c3f8b833f3f8b5b3f3fe2a33f3f8b833f3f8d6d8b83
EUC-JP 筌Q??筌l?泣??擬??筌??泣??肯泣 11100100101001011010001111010001001111110011111111100100101001011010001111101100001111111011010111100011001111110011111110110101101111000011111100111111111001001010010100111111001111111011010111100011001111110011111110111001110011101011010111100011 e4a5a3d13f3fe4a5a3ec3fb5e33f3fb5bc3f3fe4a53f3fb5e33f3fb9ceb5e3
UTF-8 筌Q뗭뒆筌l쥚泣놅㎕擬좉퍍筌듐룊泣놅㎗肯泣 111001111010110110001100111011111011110010110001111010111001011110101101111010111001001010000110111001111010110110001100111011111011110110001100111011001010010110011010111001101011001110100011111010111000011010000101111000111000111010010101111001101001001110101100111011001010001010001001111011011000110110001101111001111010110110001100111010111001001110010000111010111010001110001010111001101011001110100011111010111000011010000101111000111000111010010111111010001000001010101111111001101011001110100011 e7ad8cefbcb1eb97adeb9286e7ad8cefbd8ceca59ae6b3a3eb8685e38e95e693aceca289ed8d8de7ad8ceb9390eba38ae6b3a3eb8685e38e97e882afe6b3a3
UHC 筌Q뗭뒆筌l쥚泣놅㎕擬좉퍍筌듐룊泣놅㎗肯泣 111011111010011110100011110100011000101111101100100010101000010011101111101001111010001111101100101000101000111111101011111010001000011011101111101001111010000111101011111101001010000011101010101110111000010011101111101001111011010111100011100011111000100111101011111010001000011011101111101001111010001111010000111010011110101111101000 efa7a3d18bec8a84efa7a3eca28febe886efa7a1ebf4a0eabb84efa7b5e38f89ebe886efa7a3d0e9ebe8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)