To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?G??nf?G??n^}Y?G??nf?G??n^}bE 0011111101000111001111110011111101101110011001100011111101000111001111110011111101101110010111100111110101011001001111110100011100111111001111110110111001100110001111110100011100111111001111110110111001011110011111010110001001000101 3f473f3f6e663f473f3f6e5e7d593f473f3f6e663f473f3f6e5e7d6245
SJIS-WIN 哭G痞哈nf哭G痞哈n^}Y哭G痞哈nf哭G痞哈n^}bE 1001101001001100010001111110000101111100100110011111101101101110011001101001101001001100010001111110000101111100100110011111101101101110010111100111110101011001100110100100110001000111111000010111110010011001111110110110111001100110100110100100110001000111111000010111110010011001111110110110111001011110011111010110001001000101 9a4c47e17c99fb6e669a4c47e17c99fb6e5e7d599a4c47e17c99fb6e669a4c47e17c99fb6e5e7d6245
EUC-JP 哭G痞哈nf哭G痞哈n^}Y哭G痞哈nf哭G痞哈n^}bE 1101001110101101010001111110000111011101110100101111110101101110011001101101001110101101010001111110000111011101110100101111110101101110010111100111110101011001110100111010110101000111111000011101110111010010111111010110111001100110110100111010110101000111111000011101110111010010111111010110111001011110011111010110001001000101 d3ad47e1ddd2fd6e66d3ad47e1ddd2fd6e5e7d59d3ad47e1ddd2fd6e66d3ad47e1ddd2fd6e5e7d6245
UTF-8 哭G痞哈nf哭G痞哈n^}Y哭G痞哈nf哭G痞哈n^}bE 1110010110010011101011010100011111100111100101111001111011100101100100111000100001101110011001101110010110010011101011010100011111100111100101111001111011100101100100111000100001101110010111100111110101011001111001011001001110101101010001111110011110010111100111101110010110010011100010000110111001100110111001011001001110101101010001111110011110010111100111101110010110010011100010000110111001011110011111010110001001000101 e593ad47e7979ee593886e66e593ad47e7979ee593886e5e7d59e593ad47e7979ee593886e66e593ad47e7979ee593886e5e7d6245
UHC 哭G?哈nf哭G?哈n^}Y哭G?哈nf哭G?哈n^}bE 11001101110101100100011100111111111110011110101101101110011001101100110111010110010001110011111111111001111010110110111001011110011111010101100111001101110101100100011100111111111110011110101101101110011001101100110111010110010001110011111111111001111010110110111001011110011111010110001001000101 cdd6473ff9eb6e66cdd6473ff9eb6e5e7d59cdd6473ff9eb6e66cdd6473ff9eb6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)